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Description 

FIELD OF THE INVENTION 

5 The present invention relates to raffinose synthetase genes and their use. 

BACKGROUND OF THE INVENTION 

Raffinose family oligosaccharides are derivatives of sucrose, which are represented by o-a-D-galactopyranosyl- 
w (1~>6) n-o^-D~glucopyranosyf-(l->2)-p-D-fluctofuranoside as the general formula, and they are designated "raffinose" 
when n = 1, "stachyose" when n - 2, "verbascose" when n = 3, and "ajugose" when n = 4. 

The greatest contents of such raffinose family oligosaccharides are found in plants, except for sucrose, and it has 
been known that they are contained not only in higher plants including gymnosperms such as pinaceous plants (e.g., 
spruce) and angiosperms such as leguminous plants (e.g., soybean, kidney bean), brassicaceous plants (e.g., rape), 
15 chenopodiaceous plants (e.g., sugar beet), malvaceous plants (e.g., cotton), and salicaceous plants (e.g., poplar), but 
also in green algae, chlorella. Thus, they occur widely in the plant kingdom similarly to sucrose. 

Raffinose family oligosaccharides play a role as reserve sugars in the storage organs or seeds of many plants or 
as translocating sugars in the phenomenon of sugar transportation between the tissues of some plants. 

Furthermore, it has been known that raffinose family oligosaccharides have an effect of giving good conditions of 
20 nterobacterial f bra, if present at a suitable amount in food. Therefore, raffinose family oligosaccharides have already 
been used as a functional food material for addition to some kinds of food and utilized in the field of specified healthful 
food. 

Raffinose family oligosaccharides having such a role and utility are produced by the raffinose oligosaccharide syn- 
thesis system beginning with sucrose in many plants. This biosynthesis system usually involves a reaction for the 
25 sequential addition of galactosyl groups from galactotinol through an a(t-»6) bond to a hydroxyl group attached to the 
carbon atom at position 6 of a D-glucose residue in a sucrose molecule. 

In the first step of this biosynthesis system, raffinose synthetase is an enzyme concerned in the reaction of raffi- 
nose production by combining a D-galactosyl group from galactotinol through an a(1->6) bond with a hydroxyl group 
attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. It has been suggested that this 
30 enzyme constitutes a rate limiting step in the above synthesis system, and it has been revealed that this enzyme is quite 
important in the control of biosynthesis of raffinose family oligosaccharides. 

The control of expression level or activity of raffinose synthetase in plants makes it possible to change the contents 
of raffinose family oligosaccharides in these plants. However, raffinose synthetase, although the presence of this 
nzyme itself was already confirmed in many plants by the measurement of its activity with a biochemical technique, 
35 has not yet been successfully isolated and purified as a homogeneous protein. In addition, the amino acid sequence of 
this enzyme is still unknown, and no report has been made on an attempt at beginning to isolate a gene coding for this 
enzyme. 

SUMMARY OF THE INVENTION 

40 

Under these circumstances, the present inventors have intensively studied and finally succeeded in isolating a raffi- 
nose synthetase and a gene coding for this enzyme from broad bean, thereby completing the present invention. 
Thus, the present invention provides the following: 

45 1) A raffinose synthetase gene isolated from a plant and having a nucleotide sequence coding for an amino acid 
sequence of a protein capable of producing raffinose by combining a D-galactosyl group through an a(1->6) bond 
with a hydroxyl group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. 

2) The raffinose synthetase gene according to item 1 , wherein the plant is a dicotyledon. 

3) The raffinose synthetase gene according to item 2, wherein the dicotyledon is a leguminous plant. 
so 4) The raffinose synthetase gene according to item 3, wherein the leguminous plant is broad bean. 

5) A raffinose synthetase gene having a nucleotide sequence coding for protein (a) or (b) as defined below: 

(a) protein having the amino acid sequence of SEQ ID NO:1 ; 

(b) protein having an amino acid sequence derived by deletion, replacement, modification or addition of one or 
65 several amino acids in the amino acid sequence of SEQ ID NO:1 , and capable of producing raffinose by com- 
bining a D-galactosyl group through an a(1->6) bond with a hydroxyl group attached to the carbon atom at 
position 6 of a D-glucose residue in a sucrose molecule. 
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6) A raffinose synthetase gene having the nucleotide sequence f SEQ ID NO:2. 

7) The raffinose synthetase gene according to item 3, wherein the leguminous plant is soybean. 

8) A raff inos synthetase gen having a nucleotide sequence coding for protein (a) or (b) as defined below: 

s (a) protein having the amino acid sequence of SEQ ID NO:3; 

(b) protein having an amino acid sequence derived by deletion, replacement, modification or addition of one or 
several amino acids in the amino acid sequence of SEQ ID NO:3, and capable of producing raffinose by com- 
bining a D-galactosyl group through an a(1->6) bond with a hydroxyl group attached to the carbon atom at 
position 6 of a D-glucose residue in a sucrose molecule. 

w 

9) A raffinose synthetase gene having the nucleotide sequence of SEQ ID NO:4. 

10) The raffinose synthetase gene according to item 2, wherein the dicotyledon is a famiaceous plant. 

11) The raffinose synthetase gene according to Hem 10, wherein the lamiaceous plant is Japanese artichoke. 

12) A raffinose synthetase gene having a nucleotide sequence coding for the amino acid sequence of SEQ ID 
15 NO:5. 

13) A raffinose synthetase gene having the nucleotide sequence of SEQ ID NO:6. 

14) The raffinose synthetase gene according to item 1 , wherein the plant is a monocotyledon. 

15) The raffinose synthetase gene according to item 14, wherein the mono-cotyledon is a gramineous plant. 

16) The raffinose synthetase gene according to Hem 15, wherein the gramineous plant is corn. 

20 1 7) A raffinose synthetase gene having a nucleotide sequence coding for the amino acid sequence of SEQ ID 
NO:7. 

18) A raffinose synthetase gene having the nucleotide sequence of SEQ ID NO:8. 

19) A raffinose synthetase protein having amino acid sequence (a) or (b) as defined below: 

25 (a) amino acid sequence of SEQ ID NO:1 or SEQ ID NO:3; 

(b) amino acid sequence derived by deletion, replacement, modification or addition of one or several amino 

acids in the amino acid sequence of SEQ ID NO:1 or SEQ ID NO:3; 

the protein being capable of producing raffinose by combining a D-galactosyl group through an a(1->6) 

bond with a hydroxyl group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose mof- 
30 ecule. 

20) A raffinose synthetase protein having the amino acid sequence of SEQ ID NO:1 or SEQ ID NO:3. 

21) A gene fragment having a partial nucleotide sequence of the raffinose synthetase gene of Hem 1 , 2, 3, 4, 7, 1 0, 
11, 14, 15or16. 

35 22) A gene fragment having a partial nucleotide sequence of the raffinose synthetase gene of Hem 5, 6, 8, 9, 12, 
13, 17 or 18. 

23) The gene fragment according to item 21 or 22, wherein the number of nucleotides is in the range of from 15 to 
50. 

24) A method for the detection of a raffinose synthetase gene or a gene fragment having a partial nucleotide 
40 sequence thereof, which comprises hybridizing a probe of the labeled gene fragment of item 21, 22 or 23 to an 

organism-derived genomic DNA or cDNA fragment; and detecting the DNA fragment bound specifically to the 
probe. 

25) A method for the detection of a raffinose synthetase gene or a gene fragment having a partial nucleotide 
sequence thereof, which comprises hybridizing a probe of the labeled gene fragment of item 21 , 22 or 23 to a plant- 

45 derived genomic DNA or cDNA fragment: and detecting the DNA fragment bound specifically to the probe. 

26) A method for the amplification of a raffinose synthetase gene or a gene fragment having a partial nucleotide 
sequence thereof, which comprises annealing a primer having a nucleotide sequence of the gene fragment of item 
21 , 22 or 23 to organism-derived genomic DNA or cDNA; and amplifying the resulting DNA fragment by polymer- 
ase chain reaction. 

so 27) A method for the amplification of a raffinose synthetase gene or a gene fragment having a partial nucleotide 
sequence thereof, which comprises annealing a primer having a nucleotide sequence of the gene fragment of item 
21 , 22 or 23 to plant-derived genomic DNA or cDNA; and amplifying the resulting DNA fragment by polymerase 
chain reaction. 

28) A method for obtaining a raffinose synthetase gene, comprising the steps of identifying a DNA fragment con- 
55 taining a raffinose synthetase gene or a gene fragment having a partial nucleotide sequence thereof by the method 

of Hem 24, 25, 26 or 27; and isolating and purifying the DNA fragment identified. 

29) A raffinose synthetase gene obtained by identifying a DNA fragm nt containing a raffinose synthetase gen or 
a gen fragment having a partial nucleotide sequence thereof by the method of item 24, 25, 26 or 27; and isolating 
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and purifying the DNA fragment id ntified. 

30) A chimera gene comprising the raff inose synthetase gene of item 1 , 2, 3, 4, 5 f 6, 7, 8, 9, 10, 1 1 , 1 2, 1 3, 1 4, 1 5, 
16, 17, 18 or 29, and a promoter linked thereto. 

31) A transfbrmant obtained by introducing the chimera gene of item 30 into a host organism. 

5 32) A plasmid comprising the raff inose synthetase gene of item 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 

17, 18, 29 or 30. 

33) A host organism transformed with the plasmid of item 32, or a cell thereof. 

34) A microorganism transformed with the plasmid of item 32. 

35) A plant transformed with the plasmid of item 32, or a cell thereof. 

10 36) A method for metabolic modification, which comprises introducing the raff inose synthetase gene of item 1 , 2, 
3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 1 7, 18, 29 or 30 into a host organism or a cell thereof, so that the content 
of raff inose family oligosaccharides in the host organism or the cell thereof is changed. 

37) A method for the production of a raffinose synthetase protein, which comprises isolating and purifying a raffi- 
nose synthetase protein from a culture obtained by cultivating the microorganism of item 34. 
is 38) An anti-raff inose synthetase antibody capable of binding to the raffinose synthetase protein of item 19 or 20. 

39) A method for the detection of a raffinose synthetase protein, which comprises treating a test protein with the 
anti-raffinose synthetase antibody of item 38; and detecting the raffinose synthetase protein by antigen-antibody 
reaction between the antibody and the raffinose synthetase protein. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the construction of plasmids used for the expression of a raffinose synthetase gene in Escherichia 
coli. pBIuescriptKS-RS is a plasmid containing the raffinose synthetase gene cloned therein. RS represents the raffi- 
nose synthetase gene, and the nucleotide sequences shown in the upper portion of this figure are those of both termi- 

25 nal portions of the raffinose synthetase gene. A partial sequence represented by small letters is a nucleotide sequence 
derived from the vector pBluescriptll KS-. Two boxed nucleotide sequences are the initiation codon (ATG) and termina- 
tion codon (TGATAA) of the raffinose synthetase gene, respectively. The recognition sites for several restriction endo- 
nuc! eases are shown above the nucleotide sequences. pGEX-RS and pTrc-RS are plasmids used for the expression of 
the raffinose synthetase gene in E. coil Ptac, Ptrc, GST, lacl q , and rrnB represent tac promoter, trc promoter, gfutath- 

30 ione-S-transferase gene, lactose repressor gene, and termination signal for the transcription of ribosomal RNA, respec- 
tively. 

Figure 2 shows the construction of expression vectors used for the expression in plants of chimera genes each hav- 
ing a raffinose synthetase gene and a promoter linked thereto. The restriction endonuclease map of the raffinose syn- 
thetase gene cloned in the plasmid pBIuescriptKS-RS is shown in the lower portion of this figure. pBI221RS and 
35 pBI221 (-) RS indicate the restriction endonuclease maps of expression vectors used for the transformation of soybean. 
35S and NOS represent 35S promoter derived from cauliflower mosaic virus and nopal ine synthetase gene terminator, 
respectively. 

Figure 3 shows the construction of expression vectors used for the expression in plants of chimera genes each hav- 
ing a raffinose synthetase gene and a promoter linked thereto. The restriction endonuclease map of the raffinose syn- 
40 thetase gene cloned in the plasmid pBIuescriptKS-RS is shown in the upper portion of this figure. pBH21RS and 
pBI121(-)RS indicate the restriction endonuclease maps of binary vectors used for the transformation of mustard. For 
the binary vector, only a region between the right border and the left border is shown. 35S, NOS and NPT represent 
35S promoter derived from cauliflower mosaic virus, nopaline synthetase gene terminator and kanamycin resistance 
gene, respectively. 

45 

DETAILED DESCRIPTION OF THE INVENTION 

The gene engineering methods described below can be carried out according to ordinary methods, for example, as 
described in "Molecular Cloning: A Laboratory Manual 2nd edition" (1989), Cold Spring Harbor Laboratory Press, ISBN 
so 0-87969-309-6; "Current Protocols In Molecular Biology" (1987), John Wiley & Sons, Inc. ISBN 0-471 -50338-X; and 
"Current Protocols In Protein Science" (1995), John Wiley & Sons, Inc. ISBN 0-471 -1 1 1 84-8. 

The term "raffinose synthetase gene" as used herein refers to a gene having a nucleotide sequence coding for the 
amino acid sequence of a protein capable of producing raffinose by combining a D-galactosyl group through an a(1^6) 
bond with a hydroxyl group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule 
55 (hereinafter referred to simply as the present gene), and such a gene can be prepared, for example, from plants. 

More specifically, the present gene can be prepared from dicotyledons such as leguminous plants (e.g., broad 
bean, soybean) and lamiaceous plants (e.g., Japanese artichoke) or from monocotyledons such as gramineous plants 
(e.g., corn). Specific examples of the present gene are a "raffinose synth tase gene having a nucleotide sequence cod- 
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ing for a protein having the amino acid sequence of SEQ ID NO:1"; a "raffinose synthetase gene having a nucleotide 
sequence coding for a pr t in having an amino acid sequence derived by deletion, replacement, modification or addi- 
tion of one or sev ral amino acids in the amino acid sequence of SEQ ID NO:1, and capable of producing raffinose by 
combining a D-galactosyl group through an a(1 -^6) bond with a hydroxyl group attached to the carbon atom at position 

5 6 of a D-glucose residue in a sucrose molecule"; a "raffinose synthetase gene having a nucleotide sequence coding for 
a protein having the amino acid sequence of SEQ ID NO:3 H ; a "raffinose synthetase gene having a nucleotide sequence 
coding for a protein having an amino acid sequence derived by deletion, replacement, modification or addition of one 
or several amino acids in the amino acid sequence of SEQ ID NO:3, and capable of producing raffinose by combining 
a D-galactosyl group through an a(1-»6) bond with a hydroxyl group attached to the carbon atom at position 6 of a D- 

w glucose residue in a sucrose molecule"; a "raffinose synthetase gene having a nucleotide sequence coding for the 
amino acid sequence of SEQ ID NO:5"; and a "raffinose synthetase gene having a nucleotide sequence coding for the 
amino acid sequence of SEQ ID NO:7." 

In a further embodiment, the invention relates to nucleic acid molecules of at least 15 nucleotides in length hybrid- 
izing specifically with a nucleic acid molecule as described above or with a complementary strand thereof. Specific 

is hybridization occurs preferably under stringent conditions and implies no or very little cross-hybridization with nucle- 
otide sequences encoding no or substantially different (oligopeptides. Such nucleic acid molecules may be used as 
probes and/or for the control of gene expression. Nucleic add probe technology is well known to those skilled in the art 
who will readily appreciate that such probes may vary in length. Preferred are nucleic acid probes of 17 to 50, particu- 
larly preferred 21 to 50 nucleotides in length. Of course, it may also be appropriate to use nucleic acids of up to 1 00 and 

20 more nucleotides in length. The nucleic acid probes of the invention are useful for various applications. On the one 
hand, they may be used as PCR primers for amplification of regulatory sequences according to the invention. Another 
application is the use as a hybridization probe to identify nucleic acid molecules hybridizing with a nucleic acid molecule 
of the invention by homology screening of genomic DNA libraries. Nucleic acid molecules according to this preferred 
embodiment of the invention which are complementary to a nucleic acid molecule as described above may also be used 

25 for repression of expression of a gene comprising such regulatory sequences, for example due to an antisense or triple 
helix effect or for the construction of appropriate ribozymes (see, e.g., EP-B1 0 291 533, EP-A1 0 321 201, EP-A2 0 
360 257) which specif ically cleave the (pre)-mRNA of a gene comprising a nucleic acid molecule of the invention. Selec- 
tion of appropriate target sites and corresponding ribozymes can be done as described for example in Steinecke, 
Ribozymes, Methods in Cell Biology 50, Galbraith et al. eds Academic Press, Inc. (1995), 449-460. Furthermore, the 

30 person skilled in the art is well aware that it is also possible to label such a nucleic acid probe with an appropriate 
marker for specific applications, such as for the detection of the presence of a nucleic acid molecule of the invention in 
a sample derived from an organism. 

The above described nucleic acid molecules may either be DNA or RNA or a hybrid thereof. Furthermore, said 
nucleic acid molecule may contain, for example, thioester bonds and/or nucleotide analogues, commonly used in oligo- 

35 nucleotide anti-sense approaches. Said modifications may be useful for the stabilization of the nucleic acid molecule 
against endo- and/or exonucleases in the cell. Said nucleic acid molecules may be transcribed by an appropriate vector 
containing a chimeric gene which allows for the transcription of said nucleic acid molecule in the cell. Such nucleic acid 
molecules may further contain ribozyme sequences which specifically cleave the (pre)-mRNA comprising the nucleic 
acid molecule of the invention. Furthermore, oligonucleotides can be designed which are complementary to a nucleic 

40 acid molecule of the invention (triple helix; see Lee. Nucl. Acids Res. 6 (1979), 3073; Cooney. Science 241 (1988). 456 
and Dervan, Science 251 (1991), 1360), thereby preventing transcription and the production of the encoded oligopep- 
tide. 

The present gene can be obtained, for example, by the following method. 

The tissues of a leguminous plant such as broad bean ( Vicia faba) or soybean {Glycine max) are frozen in liquid 
46 nitrogen and ground physically with a mortar or other means into finely powdered tissue debris. From the tissue debris, 
RNA is extracted by an ordinary method. Commercially available RNA extraction kits can be utilized in the extraction. 
The whole RNA is separated from the RNA extract by ethanol precipitation. From the whole RNA separated, poly-A 
tailed RNA is fractionated by an ordinary method. Commercially available oiigo-dT columns can be utilized in the frac- 
tionation. cDNA is synthesized from the fraction obtained (i.e., poly-A tailed RNA) by an ordinary method. Commercially 
so available cDNA synthesis kits can be utilized in the synthesis. 

For example, cDNA fragments of the "raffinose synthetase gene having a nucleotide sequence coding for a protein 
having the amino acid sequence of SEQ ID NO:1 " as the present gene can be obtained by PCR amplification using the 
broad bean-derived cDNA obtained above as a template and primers 1 to 3 shown in list 1 below. The primers used 
therein can be designed and synthesized on the basis of the nucleotide sequence of SEQ ID NO:2, depending upon 
S5 the purpose. For example, in order to amplify the open reading frame region of the "raffinose synthetase gene having 
a nucleotide sequence coding for a protein having the amino acid sequence of SEQ ID NO:1 ," primers 1 to 4 shown in 
list 2 below may be designed and synthesized. 

In the same manner, cDNA fragments of th "raffinose synthetase gene having a nucleotide sequence coding for 
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a protein having the amino acid sequence of SEQ ID NO:3" can be obtained by PCR amplification with the soybean- 
derived cDNA obtained above as a template and, for example, primers 4 to 6 shown in list 1 below. The primers used 
ther in can be designed and synthesized on the basis of the nucleotide sequence of SEQ ID N0:4, depending upon 
the purpose. For example, in order to amplify the open reading frame region of the "raff inose synthetase gene having 

5 a nucleotide sequence coding for a protein having the amino acid sequence of SEQ ID NO:3," primers 5 to 8 shown in 
list 2 below may be designed and synthesized. 

The amplified DNA fragments can be subcloned according to ordinary methods, for example, as described in 
"Molecular Cloning: A Laboratory Manual 2nd edition" (1989), Cold Spring Harbor Laboratory Press; and "Current Pro- 
tocols In Molecular Biology" (1987), John Wiley & Sons. Inc. ISBN 0-471 -50338-X. More specifically, cloning can be 

10 effected, for example, using a TA cloning kit (Invitrogen) and a plasmid vector such as pBluescript II (Stratagene). The 
nucleotide sequences of the DNA fragments cloned can be determined by the dideoxy terminating method, for exam- 
ple, as described by F. Sanger, S. NicWen, A.R. Coulson, Proceedings of National Academy of Science U.S.A. (1977), 
74, pp. 5463-5467. For example, ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit commercially avail- 
able from Perkin-Elmer may preferably be used. 


15 
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(List 1) 

Primer 1 : AATTTTCAAG CATAGCCAAG TTAACCACCT 30 mer 

Primer 2: GCTCACAAGA TAATGATGTT AGTC 24 mer 

Primer 3: ATACAAGTGA GGAACTTGAC CA 22 mer 

Primer 4: CCAAACCATA GCAAACCTAA GCAC 24 mer 

Primer 5: ACAACAGAAA AATATGACTC TTATTACT 28 mer 

Primer 6: AAAAGAGAGT CAAACATCAT AGTATC 26 mer 


25 (List 2) 


Primer 1 : ATGGCACCAC CAAGCATAAC CAAAACTGC 29 mer 

Primer 2: ATGGCACCAC CAAGCATAAC CAAAACTGCA ACCCTCCAAG ACG 43 mer 

Primer 3: TCAAAATAAA AACTGGACCA AAGAC 25 mer 

Primer 4: TCAAAATAAA AACTGGACCA AAGACAATGT 30 mer 

Primer 5: ATGGCTCCAA GCATAAGCAA AACTG 25 mer 

Primer 6: ATGGCTCCAA GCATAAGCAA AACTGTGGAA CT 32 mer 

Primer 7: TCAAAATAAA AACTCAACCA TTGAC 25 mer 

Primer 8: TCAAAATAAA AACTCAACCA TTGACAATTT TGAAGCACT 39 mer 

The term "gene fragment" as used herein refers to a gene fragment having a partial nucleotide sequence of the 
present gene (hereinafter referred to simply as the present gene fragment). For example, it may be a gene fragment 
derived from a plant and having a partial nucleotide sequence of the gene having a nucleotide sequence coding for a 
protein capable of producing raffinose by combining a D-galactosyl group through an a(1^6) bond with a hydroxyl 
group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. Specific examples of the 
present gene fragment are a gene fragment having a partial nucleotide sequence of the gene having a nucleotide 
sequence coding for the amino acid sequence of SEQ ID NO:1 and a gene fragment having a partial nucleotide 
sequence of the gene having a nucleotide sequence of SEQ ID NO:2, more specifically a gene fragment having a nucle- 
otide sequence or a partial nucleotide sequence thereof, coding for any of the amino acid sequences shown in list 3 
below. 

These gene fragments can be used as probes in the hybridization method or as primers in the PCR method. For 
the primers in the PCR method, it is generally preferred that the number of nucleotides is greater from a viewpoint that 
the specificity of annealing is ensured; it is, however, also preferred that the number of nucleotides is not so great from 
viewpoints that the primers themselves are liable to have a higher structure giving possible deterioration of the anneal- 
ing efficiency and that complicated procedures are required in the purification after the synthesis. In usual cases, pre- 
ferred is a gene fragment consisting of single-stranded DNA, wherein the number of nucleotides is in the range of from 
15 to 50. 

(List 3) 

#1 Gly lie Lys Phe Met Ser lie Phe Arg Phe Lys Val Trp Trp Thr Thr His Trp Val Gly 
#2 He He Asp Lys Phe Gly Trp Cys Thr Trp Asp Ala Phe Tyr 
#3 Gly Gly Cys Pro Pro Gly Phe Val lie lie Asp Asp Gly Trp Gin 
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#4 Thr Ser Ala Gly Glu Gin Met Pro Cys Arg Leu Val Lys Tyr Glu Glu Asn 
#5 Val Tyr Val Trp His Ala Leu Cys Gly Tyr Trp Gly Gly Val Arg Pro 
#6 Thr Met Glu Asp Leu Ala Val Asp Lys lie Val Glu Asn Gly Val Gly Leu Val Pro Pro 
#7 Gly Leu His Ser His Leu Glu Ser Ala Gly lie Asp Gly Val Lys Val Asp Val lie His Leu Leu Glu 
5 #8 Gly Gly Arg Val Glu Leu Ala Arg Ala Tyr Tyr Lys Ala Leu 
#9 Val Lys Lys His Phe Lys Gly Asn Gly Val He Ala 

#1 0 Glu His Cys Asn Asp Phe Phe Leu Leu Gly Thr Glu Ala He Ser Leu Gly Arg Val Gly Asp Asp Phe Trp Cys Ser 

Asp Pro Ser Gly Asp Pro Asn Gly Thr Tyr Trp Leu Gin Gly Cys His Met Val His Cys 
#11 Ala Tyr Asn Ser Leu Trp Met Gly Asn Phe lie Gin Pro Asp Trp Asp Met Phe Gin Ser Thr His Pro Cys Ala Glu 
w Phe His Ala Ala Ser Arg Ala He Ser Gly Gly Pro lie Tyr Val Ser Asp 

#12 Leu Pro Asp Gly Ser He Leu Arg Cys 

#1 3 Ala Leu Pro Thr Arg Asp Cys Leu Phe Glu Asp Pro Leu His Asn Gly Lys Thr Met Leu Lys He Trp Asn 
#1 4 Gly Val Leu Gly Leu Phe Asn Cys Gin Gly Gly Gly Trp 
#1 5 Phe Ala Pro He Gly Leu Val Asn Met 

15 

The present gene fragment is labeled, and then used as a probe in the hybridization method and hybridized to 
organism-derived DNA, so that a DNA fragment having the probe specifically bound thereto can be detected. Thus, 
from an organism-derived gene library, a raff inose synthetase gene having a nucleotide sequence coding for the amino 
acid sequence of an enzyme capable of producing raff inose by combining a D-galactosyl group through an <x(1^6) 
20 bond with a hydroxy! group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule; or 
a gene fragment having a partial nucleotide sequence thereof, can be detected (hereinafter referred to simply as the 
present detection method). 

As the organism-derived DNA, for example, a cDNA library or a genomic DNA library of a desired plant can be 
used. The gene library may also be a commercially available gene library as such or a library prepared according to an 
25 ordinary library preparation method, for example, as described in "Molecular Cloning: A Laboratory Manual 2nd edition" 
(1989), Cold Spring Harbor Laboratory Press; "Current Protocols In Molecular Biology" (1987), John Wiley &Sons, Inc. 
ISBN 0-471 -50338-X. 

As the hybridization method, plaque hybridization or colony hybridization can be employed, depending upon the 
kind of vector used in the preparation of a library. More specifically, when a library to be used is constructed with a 

30 phage vector, a suitable host microorganism is mixed with the phage under infectible conditions, which is further mixed 
with a soft agar medium, and the mixture is plated on an agar medium. Thereafter, a culture is grown at 37°C until a 
plaque of an appropriate size appears. When a library to be used is constructed with a plasmid vector, the plasmid is 
transformed in a suitable host microorganism to form a transformant. The transfer mant obtained is diluted to a suitable 
concentration, and the dilution is plated on an agar medium, after which a culture is grown at 37°C until a colony of an 

35 appropriate size appears. 

In either case of the above libraries, a membrane filter is placed on the surface of the agar medium after the culti- 
vation, so that the phage or transformant is transferred to the membrane. This membrane is denatured with an alkali, 
followed by neutralization, and for example, when a nylon membrane is used, the membrane is irradiated with ultraviolet 
light, so that DNA is fixed on the membrane. This membrane is then subjected to hybridization with the present gene 

40 fragment labeled by an ordinary method as a probe. For this method, reference may be made, for example, to D.M. 
Glover ed., "DNA cloning, a practical approach" IRL PRESS (1985), ISBN 0-947946-18-7. There are various reagents 
and temperature conditions to be used in the hybridization; for example, prehybridization is carried out by the addition 
of 6 x SSC (0.9 M NaCI, 0.09 M citric acid), 0.1-1% SDS, 100 ug/ml denatured salmon sperm DNA, and incubation at 
65°C for 1 hour. The present gene fragment labeled is then added as a probe, and mixed. Hybridization is carried out 

45 at 42-68°C for 4 to 16 hours. The membrane is washed with 2 x SSC, 0.1-1% SDS, further rinsed with 0.2 x SSC, 0- 
0.1% SDS, and then dried. The membrane is analyzed, for example, by autoradiography or other techniques, to detect 
the position of the probe on the membrane and thereby detect the position of DNA having a nucleotide sequence 
homologous to that of the probe used Thus, the present gene or the present gene fragment can be detected. The clone 
corresponding to the position of DNA thus detected on the membrane is identified on the original agar medium, and the 

so positive clone is selected, so that the clone having the DNA can be isolated. The same procedures of detection are 
repeated to purify the clone having the DNA. 

Other detection methods can also be used, for example, GENE TRAPPER cDNA Positive Selection System Kit 
commercially available from Gibco BRL In this method, a single-stranded DNA library is hybridized with the present 
gene fragment biotinylated (i.e.. probe), followed by the addition of streptoavidin-bound magnet beads and mixing. From 

55 the mixture, the streptoavidin-bound magnetic beads are collected with a magnet, so that single-stranded DNA having 
a nucleotide sequence homologous to that of th probe used, which has been bound to these beads through the 
present gen fragment, biotin and streptoavidin, is collected and detected. Thus, the present gene or th present g ne 
fragment can be detected. The single-stranded DNA collected can be changed into a double-strand form by treatment 
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with a suitable DNA polymerase using a suitable oligonucleotide as a primer. 

The present detection method may also be used in the analysis of a plant. More specifically, plant genomic DNA is 
prepared according to an ordinary method, for example, as described in "Cloning and Sequence (Plant Biotechnology 
Experiment Manual)** complied under the supervision of Itaru Watanabe, edited by Masahiro Sugiura, published by 
Noson Bunka-sha, Tokyo (1 989). The plant genomic DNA is digested with several kinds of suitable restriction endonu- 
cleases, followed by electrophoresis, and the electrophoresed DNA is blotted on a filter according to an ordinary 
method. This filter is subjected to hybridization with a probe prepared from the present gene fragment by an ordinary 
method, and DNA fragments to which the probe hybridizes are detected. The DNA fragments detected are compared 
in length between different varieties of a specified plant species. The differences in length make possible the analysis 
of differences in phenotypic characteristics accompanied with the expression of raffinose family oligosaccharides 
between these varieties. Furthermore, when the DNA fragments detected by the above method are compared in length 
between the gene recombinant plant and the non-gene recombinant plant of the same variety, the former plant can be 
discriminated from the latter plant by the detection of hybridizing bands greater in number or higher in concentration for 
the former plant than for the latter plant. This method can be carried out according to the RFLP (restriction fragment 
length polymorphism) method, for example, as described in "Plant PCR Experiment Protocols" complied under the 
supervision of Kb Shtmamoto and Takuji Sasaki, published by Shujun-sha, Tokyo (1995), ISBN 4-S7962- 144-7, pp. 90- 
94. 

The PCR method using a primer having the nucleotide sequence of the present gene fragment makes it possible 
to amplify from organism-derived DNA, a raffinose synthetase gene having a nucleotide sequence coding for the amino 
acid sequence of an enzyme capable of producing raffinose by combining a D-galactosyl group through an cc(1-»6) 
bond with a hydroxyl group attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule; or 
a gene fragment having a partial nucleotide sequence thereof (hereinafter referred to simply as the present amplifica- 
tion method). 

More specifically, for example, an oligonucleotide having 15 to 50 nucleotides in the nucleotide sequence of the 
present gene fragment at the 3 -terminus is chemically synthesized by an ordinary synthesis method. Based on the 
codon table below, showing the correspondence of amino acids encoded in nucleotide sequences, a mixed primer can 
also be synthesized so that a residue at a specified position in the primer is changed to a mixture of several bases, 
depending upon the variation of codons which can encode a certain amino acid. 


CODON TABLE 

Phe 

UUU 

Ser 

UCU 

Tyr 

UAU 

Cys 

UGU 


UUC 


UCC 


UAC 


UGC 

Leu 

UUA 


UCA 

Stop 

UAA 

Stop 

UGA 


UUG 


UCG 


UAG 

Trp 

UGG 


CUU 

Pro 

ecu 

His 

CAU 

Arg 

CGU 


cue 


CCC 


CAC 


CGC 


CUA 


CCA 

Gin 

CAA 


CGA 


CUG 


CCG 


CAG 


CGG 

lie 

AUU 

Thr 

ACU 

Asn 

AAU 

Ser 

AGU 


AUC 


ACC 


AAC 


AGC 


AUA 


ACA 

Lys 

AAA 

Arg 

AGA 

Met 

AUG 


ACG 


AAG 


AGG 

Val 

GUU 

Ala 

GCU 

Asp 

GAU 

Gly 

GGU 


GUC 


GCC 


GAC 


GGC 


GUA 


GCA 

Glu 

GAA 


GGA 


GUG 


GCG 


GAG 


GGG 


Furthermore, a base capable of forming a pair with plural kinds of bases, such as inosine, can also be used instead 
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of the above mixture of several bases. More specifically, for example, primers having nucleotid sequences as shown 
in list 4 can be used. In this context, an oligonucleotid having the same nucleotide sequence as th coding strand of 
the present gene consisting of double-stranded DNA is designated "sense primer," and an oligonucleotide having a 
nucleotide sequence complementary to the coding strand, "anti sense primer." 

5 A sense primer having the same nucleotide sequence as present on the 5'-upstream side in the coding strand of a 

raff inose synthetase gene or a gene fragment having a partial nucleotide sequence thereof to be amplified, and an anti- 
sense primer having a nucleotide sequence complementary to the nucleotide sequence on the 3'<iownstream side in 
this coding strand, are used in combination for PGR reaction to amplify a DNA fragment, for example, with a gene 
library, genomic DNA or cDNA as a template. At this time, the amplification of a DNA fragment can be confirmed by an 

w ordinary method with electrophoresis. For the DNA fragment amplified, its restriction endonuclease map is constructed 
or its nucleotide sequence is determined by an ordinary method, so that the present gene or the present gene fragment 
can be identified. As the gene library used herein, for example, a cDNA library or a genomic cDNA library of a desired 
plant can be used. For the plant gene library, a commercially available library derived from plant can be used as such; 
or a library prepared according to an ordinary library preparation method, for example, as described in "Molecular Clon- 

75 ing: A Laboratory Manual 2nd edition" (1989), Cold Spring Harbor laboratory Press or "Current Protocols in Molecular 
Biology" (1987), John Wiley & Sons, inc., ISBN 0-471 -50338-X, can also be used. As the genomic DNA or cDNA used 
in the present amplification method, for example, cDNA or genomic cDNA prepared from a desired plant can be used. 

More specifically, for example, a primer designed on the amino acid sequence of SEQ ID NO:1 is used for the 
present amplification method with cDNA derived from Japanese artichoke, which is a lamiaceous plant, as a template, 

20 so that a raff inose synthetase gene fragment having the nucleotide sequence of SEQ ID NO:6 can be amplified. Fur- 
thermore, for example, a primer designed on the amino acid sequence of SEQ ID NO:1 is used for the present amplifi- 
cation method with cDNA derived from corn, which is a gramineous plant, as a template, so that a raffinose synthetase 
gene fragment having the nucleotide sequence of SEQ ID NO:8 can be amplified. 


25 

(List 4) 



1-F 

32mer 



TTIAAIGTITGGTGGACIACICAITGGGTIGG 


2-F 

41mer 

30 


ATIATIGAIAAITTIGGITGGTGIACITGGGAIGCITTITA 


2-RV 

41mer 



TAIAAIGCITCCCAIGTICACCAICCIAAITTITCIATIAT 


3-F 

44mer 



GGIGGITGICCICCIGGITTIGTIATIATIGAIGAIGGITGGCA 

35 

3-RV 

44mer 



TGCCAICCITCITCIATIATIACIAAICCIGGIGGICAICCICC 


4-F 

32mer 



AAIAAICAITTIAAIGGIAAIGGIGTIATIGC 


4-RV 

32mer 

40 


GCIATIACICCITTICCITTIAAITGITTITT 


5-F 

38mer 



TGGATGK3GIAAITTIATICAICCIGAITGGGAIATGTT 


5-RV 

38mer 



AACATITCCCAITCIGGITGIATIAAITTICCCATCCA 

45 

6-RV 

27mer 



CATITTIACIA(AG)ICCIATIGGIGCIAA 


The present amplification method can also be utilized for the analysis of a plant gene. More specifically, for exam- 
ple, plant genomic DNA prepared from different vaneties of a specified plant species is used as a template for the 

so present amplification method to amplify a DNA fragment. The DNA fragment amplified is mixed with a solution of for- 
maldehyde, which is denatured by heating at 85° C for 5 minutes, followed by rapid cooling on ice. This sample is sub- 
jected to electrophoresis, for example, on a 6% polyacrylamide gel containing 0% or 10% glycerol. In this 
electrophoresis, a commercially available electrophoresis apparatus for SSCP (single strand conformation polymor- 
phism) can be used, and electrophoresis is carried out, while the gel is kept at a constant temperature, e.g., 5°C, 25°C 

55 or 37°C. From the electrophoresed gel, a DNA fragment is detected, for example, by a method such as silver staining 
method with commercially available reagents. 

From th diff rences of behavior betwe n the varieties in the lectrophoresis of the DNA fragment detected, a 
mutation in the raff inos synthetase gen is detected, and an analysis is carried out for differences caused by the muta- 
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tion in phenalypic characteristics accompanied with the expressi n of raffinose family oligosaccharides. This method 
can be carried out according to the SSCP method, for example, as described in "Plant PCR Experiment Protocols" 
complied under the supervision of Ko Shimamoto and Takuji Sasaki, published by Shujun-sha, Tokyo (1995), ISBN 4- 
87962-144-7, pp. 141-146. 

The present detection method or the present amplification method as described above can also be used for iden- 
tifying a raffinose synthetase gene or a gene fragment having a partial nucleotide sequence thereof and then isolating 
and purifying the identified gene or gene fragment thereof to obtain the present gene (hereinafter referred to simply as 
the present gene acquisition method). 

The present gene or the present gene fragment can be obtained, for example, by detecting a probe consisting of 
the present gene fragment hybridized to DNA in the organism-derived gene library by the present detection method as 
described above to identify DNA having a nucleotide sequence homologous with the probe used; purifying a clone car- 
rying the DNA; and isolating and purifying plasmid or phage DNA from the clone. When the DNA thus obtained is a 
gene fragment having a partial nucleotide sequence of the raffinose synthetase gene, further screening of the gene 
library by the present gene detection method using the DNA as a probe gives the present gene in full length. 

The present gene or the present gene fragment can be identified, for example, by effecting polymerase chain reac- 
tion using a primer having the nucleotide sequence of the present gene fragment to amplify a DNA fragment from the 
organism-derived DNA by the present amplification method as described above; and then constructing a restriction 
endonuclease map or determining a nucleotide sequence for the amplified DNA fragment. Based on the nucleotide 
sequence of the gene fragment obtained, an antisense primer is synthesized for the analysis of S'-terminal sequences, 
and a sense primer is synthesized for the analysis of 3-terminal sequences. The nucleotide sequence of the present 
gene in full length can be determined by the RACE method using these primers and a commercially available kit, e.g., 
Marathon Kit of Clontech. The present gene in full length can be obtained by synthesizing new primers based on both 
terminal sequences in the nucleotide sequence thus determined and effecting polymerase chain reaction again. 

The present gene acquisition method as described above makes it possible to obtain raffinose synthetase genes 
as the present gene from various organisms. For example, a gene coding for a raffinose synthetase having an amino 
acid sequence that has about 50% or higher homology, in the region corresponding to the length of 400 or more amino 
acids, with the amino acid sequence of SEQ ID NO:1 , and capable of producing raffinose by combining a D-galactosyl 
group through an a(1-»6) bond with a hydroxyl group attached to the carbon atom at position 6 of a D-glucose residue 
in a sucrose molecule. More specifically, for example, a raffinose synthetase gene having the nucleotide sequence of 
SEQ ID NO:4 can be obtained by amplifying and identifying a DNA fragment containing a gene fragment having a par- 
tial nucleotide sequence of the raffinose synthetase gene by the present amplification method using primers designed 
from the amino acid sequence of SEQ ID NO:1 and soybean cDNA as a template; isolating and purifying the identified 
DNA fragment, followed by the above procedures to obtain a full-length gene containing the DNA fragment. 

A chimera gene comprising the present gene and a promoter linked thereto (hereinafter referred to simply as the 
present chimera gene) can be constructed. 

The promoter to be used is not particularly limited, so long as it is functionable in a host organism to be trans- 
formed. The promoter may include, for example, synthetic promoters functionable in Escherichia co//, such as E. coli 
lactose operon promoter, E. coli tryptophan operon promoter and tac promoter; yeast alcohol dehydrogenase gene 
(ADH) promoter, adenovirus major late (Ad.ML) promoter, SV40 early promoter, and baculovirus promoter. 

When the host organism is a plant or a cell thereof, the promoter may include, for example, T-DNA derived consti- 
tutive promoters such as nopaline synthetase gene (NOS) promoter and octopine synthase gene (OCS) promoter; 
plant virus-derived promoters such as cauliflower mosaic virus (CaMV) derived 19S and 35S promoter; derived promot- 
ers such as phenylalanine ammonia-lyase (PAL) gene promoter, chalcone synthetase (CHS) gene promoter and patho- 
genesis-related protein (PR) gene promoter. Furthermore, vector pSUM-GY1 (see JP-A 06-189777/1994) can also be 
used, which has a promoter giving specific expression in a specified plant tissue, e.g., a promoter of soybean-derived 
seed storage protein glycinin gene. The use of a chimera gene constructed so as to have such a promoter makes it pos- 
sible to increase or decrease the content of raffinose family oligosaccharides in a specified tissue of a plant. 

The present chimera gene is then introduced into a host organism according to an ordinary gene engineering 
method to give a transformant. If necessary, the present chimera gene may be used in the form of a plasmid, depending 
upon the transformation method for introducing the gene into the host organism. Furthermore, the present chimera 
gene may contain a terminator. In this case, it is generally preferred that the chimera gene is constructed so as to have 
a terminator downstream the raffinose synthetase gene. The terminator to be used is not particularly limited, so long as 
it is functionable in a host organism to be transformed. For example, when the host organism is a plant or a cell thereof, 
the terminator may include, for example. T-DNA derived constitutive terminators such as nopaline synthetase gene 
(NOS) terminator; and plant derived terminators such as terminators of allium virus GV1 or GV2. 

If necessary, the present gen may be used in the form of a plasmid. For example, when the host organism is a 
microorganism, the plasmid constructed is introduced into the microorganism by an ordinary means, for exampl , as 
described in "Molecular Cloning: A Laboratory Manual 2nd edition" (1989), Cold Spring Harbor laboratory Press or 
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"Current Protocols in Molecular Biology" (1987), John Wiley & Sons, Inc., ISBN 0-471 -50338-X. The microorganism 
thus transformed is selected with a marker such as antibiotic resistanc or auxotrophy. When the host organism is a 
plant, the plasmid constructed is introduced into a plant cell by an ordinary means such as infection with Agrobacterium 
(see JP-B 2-58917/1990 and JP-A 60-70080/1985), electroporation into protoplasts (see JP-A 60-251887/1985 and 
JP-B 5-68575/1 993) or particle gun method (see JP-A 5-50831 6/1 993 and JP-A 63-258525/1 988). The plant cell trans- 
formed by the introduction of a plasmid is selected with an antibiotic such as kanamycin or hygromycin. From the plant 
cell thus transformed, a transformed plant can be regenerated by an ordinary plant cell cultivation method, for example, 
as described in "Plant Gene Manipulation Manual (How to Produce Transgenic Plants)" written by Uchimiya, 1990, 
Kodan-sha Scientific (ISBN 4-06-153513-7), pp. 27-55. Furthermore, the collection of seeds from the transformed plant 
also makes it possible to prolify the transformed plant. In addition, crossing between the transformed plant obtained and 
the non-transformed plant makes it possible to produce progenic plants with the character of the transformed plant. 

The content of raff inose family oligosaccharides can be changed by introducing the present gene into a host organ- 
ism or a cell thereof, and modifying the metabolism in the host organism or the cell thereof. As such a method, for exam- 
ple, there can be used a method for metabolic modification to increase the amount of raffinose family oligosaccharides 
in a host organism or a cell thereof by constructing the present chimera gene comprising the present gene and a pro- 
moter linked thereto, in which case the present gene is linked to the promoter in an original direction suitable for tran- 
scription, translation, and expression as a protein, and then introducing the present chimera gene into the host 
organism or the cell thereof; or a method for metabolic modification to decrease the amount of raffinose family oligosac- 
charides in a host organism or a cell thereof by constructing the present chimera gene comprising the present gene and 
a promoter linked thereto, in which case the present gene is linked to a promoter in a reverse direction unsuitable for 
translation and expression as a protein, and then introducing the present chimera gene into the host organism or the 
ceil thereof. 

The term "raffinose synthetase protein" as used herein refers to a protein encoded in the present gene (hereinafter 
referred to simply to the present protein). For example, it may include an enzyme protein having the amino acid 
sequence of SEQ ID NO:1 or SEQ ID NO: 3, or having an amino acid sequence derived by deletion, replacement, mod- 
ification or addition of one or several amino acids in the amino acid sequence of SEQ ID NO:1 or SEQ ID NO:3; and 
capable of producing raffinose by combining a D-galactosyl group through an a(1^6) bond with a hydroxyl group 
attached to the carbon atom at position 6 of a D-glucose residue in a sucrose molecule. 

Specific examples of the present protein are an enzyme protein having the amino acid sequence of SEQ ID NO:1 
(799 amino acids; molecular weight, 89 kDa) and an enzyme protein having the amino acid of SEQ ID NO:3 (781 amino 
acids; molecular weight, 87 kDa). 

The present protein, although it can be prepared, for example, from leguminous plants such as broad bean ( Vicia 
faba), by an ordinary biochemical method such as (NHJgSC^ precipitation, ion exchange column, hydrophobic column, 
hydroxyapatite column or gel filtration column, can also be prepared from the host organism transformed with the 
present plasmid, or a cell thereof. More specifically; for example, using GST Gene Fusion Vectors Kit of Pharmacia, the 
present gene is inserted into an expression vector plasmid attached to the Kit. The resulting vector plasmid is intro- 
duced into a microorganism such as E. colt according to an ordinary gene engineering method. A culture of the trans- 
formant obtained is grown on a medium with the addition of IPTG (isopropylthio-p-D-galactoside), so that the present 
protein can be expressed and derived as a fused protein in the cufture. The fused protein expressed and induced can 
be isolated and purified by an ordinary method such as disruption of bacterial cells, column operation or SDS-PAGE 
electrophoresis. The digestion of the fused protein with a protease such as thrombin or Wood coagulation factor Xa 
gives the present protein. This may preferably be made, for example, according to the method described in "Current 
Protocols In Protein Science" (1995), John Wiley & Sons, Inc. ISBN 0-471-11184-8. The activity of the present protein 
can be measured, for example, by the method described in L. Lehle and W. Tanner, Eur. J. Biochem., 38, 103-110 
(1973). 

An anti-raffinose synthetase antibody capable of binding to a raffinose synthetase protein (hereinafter referred to 
simply as the present antibody) can be produced by an ordinary immunological method using the present protein pre- 
pared above, as an antigen. More specifically, the present antibody can be produced, for example, according to the 
method described in Ed Harlow and David Lane, "Antibodies: A Laboratory Manual" (1988), Cold Spring Harbor Labo- 
ratory Press, ISBN 0-87969-314-2. 

The present protein can be detected by treating test proteins with the present antibody and detecting a protein hav- 
ing the present antibody bound specifically thereto. Such a detection method can be carried out according to an immu- 
nological technique such as Western Wot method or enzyme-linked immunosorbent assay (ELISA), for example, as 
described in Ed Harlow and David Lane, "Antibodies: A Laboratory Manual" (1988), Cold Spring Harbor Laboratory 
Press. 

Th Western Wot method is carried out, for example, as follows: Proteins are extracted from a plant, for example, 
according to the method described in Methods in Enzymology, volume 182, "Guide to Protein Purification," pp. 174-193, 
ISBN 9-12-1 82083-1 . The composition of an extraction buffer can suitably be changed depending upon the plant tissue 
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used. The proteins extracted are electrophoresed according to an ordinary SDS-PAGE method. The proteins lectro- 
phoresed in the gel ar transferred to a membran by Western blotting with an ordinary electrical method. Mor specif- 
ically, for example, the gel is immersed in a transfer buffer (25 mM Tris, 192 mM glycine, 20% methanol) for 10 minutes, 
and then placed onto a PVDF membrane cut into the same size as that of the gel. The gel together with the membrane 
is set in a commercially available transfer apparatus of the semi-dry type. Blotting is carried out at a constant current of 
0.8 to 2 mA/cm 2 for 45 minutes to 1 hour. The proteins transferred to the membrane can be detected immunologically 
with a kit for Western blot detection using a primary antibody, and a secondary antibody or protein A, which has been 
labeled with alkaline phosphatase or horseradish peroxidase. At this time, the present protein on the membrane call be 
detected by the use of the present antibody as a primary antibody. 

In the ELISA method, for example, the property of proteins binding to the surface of a 96-well ELfSA plate made of 
a resin is utilized in principle for the immunological detection of an antigen finally bound to the surface of the ELISA 
plate. The test proteins are added as a solution and bound to an ELISA plate, followed by blocking, for example, by the 
addition of PBS containing a protein such as 5% bovine serum albumin. Thereafter, the well is washed with PBS, to 
which a solution containing the present antibody is added to effect the reaction. After the well is washed, a solution con- 
taining a secondary antibody labeled with alkaline phosphatase or horseradish peroxidase is further added to the well, 
followed by washing. Finally, a substrate solution for detection is added to the well, and the color development of the 
substrate is detected with an ELISA reader. 

In another method, the present antibody is added and bound to an ELISA plate, followed by blocking, for example, 
by the addition of PBS containing a protein such as 5% bovine serum albumin. The test proteins are then added as a 
solution, and an antigen contained in the test proteins is bound to the present antibody that has been bound to the plate, 
followed by washing, and the present antibody is further added to the well. The present antibody used at this time is 
preferably one prepared from an animal species different from that used for the preparation of the present antibody 
used first. A solution containing a secondary antibody labeled with alkaline phosphatase or horseradish peroxidase is 
then added to the well, followed by washing. The secondary antibody used at this time must have the property of bind- 
ing to the present antibody added later. Finally, a substrate solution for detection is added, and the color development 
is detected with an ELISA reader. 

Examples 

The present invention will be further illustrated by the following examples; however, the present invention is not lim- 
ited to these example in any way whatsoever. 

Example 1 (Purification of Galactinol) 

About 250 ml of sugar beet blackstrap molasses was five-told diluted with methanol. The dilution was centrrfuged 
at 21 ,400 x g for 15 minutes at room temperature to remove insoluble matter. The supernatant obtained was transferred 
into a 2-liter Erlenmeyer flask, to which isopropanol at a half volume was added portionwise with stirring. The flask was 
left at room temperature for a while until the resulting precipitate adhered to the wall of the flask. The supernatant was 
then discarded by decantation. To the precipitate was added 500 ml of ethanol, and the mixture was washed by stirring 
with a rotary shaker. The washing was further repeated several times. The washed precipitate was scraped off from the 
wall of the flask, followed by air drying on a filter paper. The air-dried precipitate (dry powder) was dissolved in purified 
water to about 40% (w)v). To this solution was added AG501-X8(D) of BioRad, followed by stirring. This operation was 
repeated until the color of the solution became almost unobserved. The resulting solution was treated with a Sep-Pak 
QMA column of Millipore, and further pretreated with Sep-Pak CM, Sep-Pak C18 and Sep-Pak Silica columns of Milli- 
pore. The resulting solution was loaded at a volume of 5 ml onto a column of Wako-gel LP40C18 (Wako Pure Chemical 
Industries, 2.6 cm x 85 cm), and eluted with purified water. The sugar content of the eluate was measured with a port- 
able sugar refractometer, and the sugar composition was analyzed by high performance liquid chromatography (HPLC) 
with a Sugar-pak Na (7.8 mm x 300 mm) column of Millipore. The detection of sugars was carried out with model 410 
Differential Refractometer of Waters. The eluate containing galactinol was lyophilized, and the resulting lyophilized pow- 
der was dissolved in 5 ml of purified water. The solution was loaded onto a column of TOYOPEARL HW40(S) (Toso, 2.6 
cm x 90 cm), and eluted with purified water. The eluate was analyzed in the same manner as described above, so that 
purified galactinol was obtained. 

The galactinol obtained was kept at 25°C for 40 minutes in the reaction mixture that came to contain 80 mM phos- 
phate buffer (pH 6.5), 2 mg/ml galactinol, and 8.3 U a-galactosidase (Boehringer Mannheim, E. coii overproducer 
662038). The reaction mixture was extracted with chloroform, and the water layer was analyzed by HPLC. The resulting 
galactinol was confirmed to be hydrolyzed into galactose and myo-inositol. 
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Example 2 (Measurement of Raffinose Synthase Activity) 

The raffinose synthetas activity was measured under the following conditions according to the description of L 
Lehle and W. Tanner, Eur. J. Biochem., 38, 1 03-1 1 0 (1 973). 

First, 2 \i\ of a sample to be used in the measurement of activity was added to 18 \i\ of the reaction mixture that 
came to contain 100 mM Tris-HCI (pH 7.4), 5 mM DTT (dithiothreitol), 0.01% BSA, 200 *iM sucrose, 5 mM galactinol, 
740 KBq/ml (31 .7 uM) [ 14 C] sucrose, and the reaction mixture was kept at 37°C for 3 to 20 hours. After the reaction, 30 
ill of ethanol was added to the reaction mixture, followed by stirring and centrifugation at 15,000 rpm for 5 minutes. The 
supernatant was spotted at a volume of 5 ul on an HPTLC plate of cellulose for thin layer chromatography (Merck, 10 
cm x 20 cm), and developed with n-butanol : pyridine : water : acetic acid = 60 : 40 : 30 : 3. The developed plate was 
dried and then quantitatively analyzed with an imaging analyzer (Fuji Photographic Film, FUJIX Bio Imaging Analyzer 
BAS-2000II) for the determination of [ 14 C] raffinose produced. 

Example 3 (Purification of Raffinose Synthase) 

The purification of raffinose synthetase from broad bean was carried out as follows: For each purified protein solu- 
tion, proteins present in the protein solution were analyzed by SDS-PAGE (Daiichi Kagaku Yakuhin), and the enzyme 
activity thereof was measured according to the method described in Example 2. 

First, 300 g of immature seeds of broad bean (Nintoku Issun) stored at -80°C was thawed and then peeled. The 
peeled seeds were put in 600 ml of 100 mM Tris-HCI (pH 7.4), 5 mM DTT (dithiothreitol), 1 mM EDTA, 1 mM PMSF 
(phenylmethylsutfonyl fluoride) and 1 mM benzamide, and ground on ice with a mortar. The ground material was cen- 
trifuged at 21,400 x g for 50 minutes at 4°C. To the resulting supernatant was added 10% polyethylene imine (pH 8.0) 
at a 1720 volume. The mixture was stirred at 4°C for 15 minutes, and centrifuged at 15,700 x g for 20 minutes at 4°C. 
To the resulting supernatant was added 196 g/l of (NH 4 ) 2 S0 4 with stirring. The mixture was stirred in ice for 30 minutes, 
and centrifuged at 15,700 x g for 20 minutes at 4°C. To the resulting supernatant was further added 142 g/l of 
(NH 4 ) 2 S0 4 with stirring. After the stirring in ice for 30 minutes, the mixture was centrifuged at 15,700 x g for 20 minutes 
at 4°C. The resulting precipitate was dissolved in 50 ml of 100 mM Tris-HCI (pH 7.4) and 5 mM DTT (dithiothreitol). and 
the solution wasdialyzed against 20 mM Tris-HCI (pH 7.4), 1 mM DTT (dithiothreitol) and 1 mM EDTA at 4°C overnight. 
After the dialysis, the suspension was centrifuged at 70,000 x g for 60 minutes at 4°C. To the resulting supernatant was 
added 1 mM benzamidine • HCI, 5 mM e-amino-n-caproic acid, 1 ug/ml antipain, 1 jig/ml leupeptin and 10 mM EGTA, 
and 2 M KCI was further added portionwise at a 1/40 volume. The mixture was loaded onto a column of DEAE-Sepha- 
cel (Pharmacia, 2.5 cm x 21 .5 cm) equilibrated with 0.05 M KCI, 20 mM Tris-HCI (pH 7.4), 1 mM DTT (dithiothreitol) and 
1 mM EDTA, and the adsorbed proteins were eluted with a gradient of 0.05 to 0.5 M KCI. The purification steps up to 
this stage were repeated three times, and fractions having raffinose synthase activity were combined and then purified 
as follows: 

To the eluted fraction having raffinose synthetase activity was added portionwise saturated (NH 4 ) 2 S0 4 at a 1/4 vol- 
ume. The solution was loaded onto a column of Phenyl-Sepharose (Pharmacia, 2.5 cm x 10.2 cm) equilibrated with 
20% saturated (NH^SO* 20 mM Tris-HCI (pH 7.4), 1 mM DTT (dithiothreitol) and 1 mM EDTA, and the adsorbed pro- 
teins were eluted with a gradient of 20% to 0% (NH^SO^ The resulting active fraction was diluted by the addition of 
0.01 M potassium phosphate buffer (pH 7.5) at a 2-fold volume. The diluted solution was loaded onto a column of 
Econo-Pac 10DG (BioRad, 5 ml) previously equilibrated with 0.01 M potassium phosphate buffer (pH 7.5) and 2 mM 
DTT (dithiothreitol), and the adsorbed proteins were eluted with a gradient of 0.01 to 0.5 M potassium phosphate buffer 
(pH 7.5) and 2 mM DTT (dithiothreitol). The active fraction obtained at this stage was found to have been purified up to 
6500-fold or higher specific activity. Part of the resulting purified protein solution having raffinose synthetase activity 
was loaded onto a column of Superdex 200 (Pharmacia, 1.6 cm x 60 cm) equilibrated with 0.2 M KCI, 20 mM Tris-HCI 
(pH 7.4), 1 mM DTT (dithiothreitol) and 1 mM EDTA. The purified proteins thus separated were subjected to SDS- 
PAGE, and the raffinose synthetase activity was measured. A protein band having raffinose synthetase activity was 
identified as having a molecular weight of about 90 kDa on the SDS-PAGE. 

Example 4 (Analysis of Partial Amino Acid Sequence of Raffinose Synthase) 

To about 1 ml of the purified protein solution, which had been purified with a column of Econo-Pac 10DG (BioRad, 
5 ml) in Example 3. was added 1 00% TCA at a 1/9 volume, and the mixture was left on ice for 30 minutes. After centrif- 
ugation at 10,000 x g for 15 minutes, the resulting precipitate was suspended in 500 *il of cold acetone (-20°C), followed 
by further centrifugation. This acetone washing was repeated, and the collected precipitate was dried and then dis- 
solved in 200 pJ of SDS-sample buff r, followed by SDS-PAGE. CBB staining was effected for the electrophoresed gel, 
from which the band of a raffinose synthetase protein was cut out. 

To the gel thus taken was added 1 ml of 50% acetonitrile and 0.2 M ammonium carbonate (pH 8.9), and washing 
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was continued with stirring at room temperature for 20 minutes. The gel was washed once again in the same mann r, 
and dried under reduced pressure to an extent giving a volum reduction. To this gel was 1 ml of 0.02% Tween-20 and 
0.2 M ammonium carbonate (pH 8.9), and the mixture was stirred at room temperature for 15 minutes. After removal of 
th solution, 400 id of 8 M urea and 0.4 M NH4HCO3 was added, to which 40 uJ of 45 mM DTT (dithiothreitol) was fur- 

5 ther added, and the mixture was left at 50°C for 20 minutes. After complete return to room temperature, 4 p\ of 1 M 
iodoacetic acid was added, and the mixture was stirred in the dark at room temperature for 20 minutes. After removal 
of the solution, 1 ml of purified water was added, and the mixture was stirred at room temperature for 5 minutes, fol- 
lowed by washing. After further two washings, 1 ml of 50% acetonitrile and 0.2 M ammonium carbonate (pH 8.9) was 
added, and the mixture was stirred at room temperature for 15 minutes. The same treatment was repeated once again, 

to after which the solution was removed, and the gel was dried under reduced pressure to an extent giving a volume 
reduction. 

To this gel was added a solution of Achromobacter Protease I (Takara, Residue-specific Protease Kit) at a volume 
of 100 ixl. Further added was 0.02% Tween-20 and 0.2 M ammonium carbonate (pH 8.9) to an extent that the gel was 
not exposed from the surface of the solution, and the mixture was left at 37°C for 42 hours. Further added was 500 jil 

15 of 0.09% TFA and 70% acetonitrile, and the mixture was stirred at room temperature for 30 minutes. The resulting mix- 
ture as contained in a sample tube was floated in an ultrasonic bath, followed by ultrasonic treatment (BRANSON, 60 
W output power) for 5 minutes. The tube and contents thus treated were centrifuged, and the resulting extract was col- 
lected in another silicone-coated sample tube. On the other hand, 500 \i\ of 0.09% TFA and 70% acetonitrile was added 
again to the precipitate, followed by repeated extraction in the same manner as described above. The resulting extracts 

20 were combined and then concentrated under reduced pressure to an extent giving a solution remained at a volume of 
200 to 300 To the concentrate was added 25 jd of 8 M urea and 0.4 M NH4HCO3. and the mixture was concentrated 
to an extent giving a solution remained at a volume of 100 fd or lower. The concentrate was brought to about 1 00 yd with 
purified water, and the mixture was filtered through a filter of Urtrafree C3 GV (Millipore). The filtrate obtained was then 
subjected to elution through a column of Aquapore BU-300 C-4 (2.1 mm x 300 mm) by a gradient of 0.1% TFA/2.1% to 

25 68.6% acetonitrile. Absorbance at 215 nm was monitored to collect a fraction at a peak thereof. The sample collected 
was evaporated under reduced pressure to complete dryness, and then analyzed with a Protein Sequencer 473 A of ABI 
to determine a partial amino acid sequence of a raffinose synthetase. 

Example 5 (Preparation of cDNA) 

30 

About 2 g of immature seeds of broad bean (Nintoku Issun) was frozen in liquid nitrogen and then ground with a 
mortar, to which 20 ml of Isogen (Nippon Gene) was added, and the mixture was further thoroughly ground. The ground 
material was transferred into a cerrtrHugation tube, to which 4 ml of chloroform was added, and the mixture was stirred 
with a vortex mixer and then centrifuged at 6,500 x g for 10 minutes at 4°C. The water layer was collected, to which 10 

35 ml of isopropanol was added, and the mixture was stirred and then centrifuged at 6,500 x g for 10 minutes at 4°C. The 
resulting precipitate was washed with 1 0 ml of 70% ethanol and then dissolved in 1 ml of elution buffer (1 0 mM Tris-HCI 
(pH 7.5), 1 mM EDTA, 0.1% SDS). The solution was left at 60°C for 10 minutes and then centrifuged at 10,000 x g for 
1 minute to remove insoluble matter. To the resulting supernatant was added an equivalent volume of Oligotex-dT30 
(Takara), and the mixture was stirred and then left at 65°C for 5 minutes. The mixture was placed on ice and then left 

40 for 3 minutes, to which 200 jd of 5 M NaCI was added, and the mixture was left at 37°C for 1 0 minutes. The mixture was 
then centrifuged at 10,000 x g at 4°C for 3 minutes. The precipitate was collected and then suspended in 1 ml of TE 
buffer, and the suspension was left at 65°C for 5 minutes, which was placed on ice and then left for 3 minutes, followed 
by centrifugation at 10,000 x g for 3 minutes at 4°C to remove the precipitate. 

To the resulting supernatant were added 100 id of 3 M sodium acetate and 2 ml of ethanol, and RNA was ethanol 

45 precipitated and collected. The collected RNA was washed twice with 70% ethanol and then dissolved in 20 |d of ster- 
ilized water, which was used for the subsequent cDNA synthesis. The amount of RNA obtained was determined by the 
measurement of absorbance at 260 nm. 

For the cDNA synthesis, First Strand Synthesis Kit for RT-PCR (Amercham) and cDNA Synthesis Kit (Takara) were 
used, and all operations were made according to the protocol. 

50 

Example 6 (Nucleotide sequence Analysis of Raffinose Synthase Gene from cDNA) 

Based on the amino acid sequence obtained in Example 4, mixed synthetic DNA primers having the nucleotide 
sequences shown in list 5 below were synthesized. The PCR method was carried out with Gene Amp PCR Systems 
55 2400 and DNA Thermal Cycler Model 480 of Perkin-Elmer using Advantage KlenTaq cDNA Kit of Clontech. The 
polymerase chain reaction was effected with the above primers at 94°C for 1 minute, at 50°C for 3 minutes, and at 72°C 
for 3 minutes, and this reaction was repeated forty times. As a result, the combinations of primers 8.2 and 13.3RV, prim- 
ers 13.4 and 10.3RV, and primers 7.4 and 10.3RV. having the nucleotide sequences shown in list 5 below, gave an 
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amplification of 1.2 Kb, 0.5 kb, and 1.2 kb bands, respectively. Thes amplified DNA fragments were cloned with a TA 
cloning kit (Invitrogen), followed by sequence reaction using ABI PRISM Dye Terminator Cycle Sequencing Ready 
Reaction Kit of Perkin-Elm r and nucleotide sequence analysis with a 373S DNA sequencer of ABI. As a result, these 
DNA fragments were found to have a nucleotide sequence extending from base 813 to base 1915, base 1936 to base 
2413, and base 1226 to base 2413, respectively, in the nucleotide sequence of SEQ ID NO:2. Based on these nucle- 
otide sequences, synthetic DNA primers having nucleotide sequences shown in list 6 below were prepared, and the 
nucleotide sequences in both terminal regions of cDNA were analyzed with Marathon cDNA Amplification Kit of Clon- 
tech. As a result, the nucleotide sequence of SEQ ID NO:2 was finally determined. 


(ListS) 


#8.2 

26mer 


AA(AG) AC(ATGC) GC(ATGC) CC(ATGC) AG(TC) AT(TCA) AT(TCA) GAC AA 

#13.4 

20mer 


AA(AG) ATfTCA) TGG AA(TC) CT(ATGC) AAC AA 

#7.4 

24mer 


AA(AG) GC(ATGC) AG(AG) GT(ATGC) GT(ATGC) GT(ATGC) CC(ATGC) AAG 

fflo.oHV 

21mer 


(TC)TT (AG)TT (ATGC)AG (AG)TT CCA (AGT) AT TTT 

#10.3RV 

21mer 


(TC)TT (AG)TC 0"C)TC (AG)TA (ATGC)AG (AG)AA TTT 

(List 6) 


RS-2RV 

30mer 


GGCTGAGGTTCGGTTCATTCCTGAATCATC 

RS-7 

30mer 


CCAAATGGTACATATTGGCTCCAAGGTTGT 

RS-8 

30mer 


AAGAGTGTATCTGAATTTTCACGCGCGGTG 

RS-9 

30mer 


TGGTGCAATGGGAAAACTCCAATGAGCACC 

RS-10 

30mer 


ATGAAGTGTTCTGATAGATTGAAAGTTTCG 

RS-11 

30mer 


CAGTCTCTGGAGTTTGATGATAATGCAAGT 

Example 7 (Cloning of Raffinose Synthetase Gene from Broad Bean cDNA) 


The primers designed from the amino acid sequence of SEQ ID NO:1, i.e., primers having nucleotide sequences 
shown in list 7 below, were synthesized. Using these primers and cDNA obtained in Example 5 as a template, a DNA 
fragment of the open reading frame region was amplified by PCR under the conditions described in Example 6. The 
amplified DNA fragment was digested with the restriction endonucleases Bam HI and Xba I whose recognition 
sequences were contained in the primers used. Using Ligation Kit (Takara), the DNA fragment thus digested was 
cloned in the plasmid pBluescriptll KS- (Stratagene) previously digested with Bam HI and Xba I. The nucleotide 
sequence of the cloned DNA fragment was confirmed with ABI PRISM Dye Terminator Cycle Sequencing Ready Reac- 
tion Kit of Perkin-Elmer. In the done thus obtained, it was found that the base at position 1591 in the nucleotide 
sequence of SEQ ID NO:2 had been changed from thymine (T) to cytosine (C). This was, however, a nonsense muta- 
tion without a change of the amino acid; therefore, this clone was designated pBluescriptKS-RS, and used in the sub- 
sequent experiment. 

(List 7) 

RS-N 41mer 

CGCGGATCCACCATGGCACCACCAAGCATAACCAAAACTGC 
RS-C 37mer 

TGCTCTAGATTATCAAAATAAAAACTGGACCAAAGAC 
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Example 8 (Expression f Broad Bean Raffinose Synthetase Gene in E. coli) 

The plasmid pBluescriptKS-RS having the broad bean raffinose synthetase gene obtained in Example 7 was 
digested with Bam HI and Not I, and cloned in the plasmid pGEX-4T3 (Pharmacia) digested with Bam HI and Not I to 
give the plasmid pGEX-RS as shown in Figure 1 . 

The plasmid pBluescriptKS-RS was digested with Nco I and Xba I, and cloned in the plasmid pTrc99A (Pharmacia) 
digested with Nco I and Xba I to give the plasmid pTrc-RS as shown in Figure 1 . 

These plasmids were introduced into £. coli strain HB1 01 , and the resulting translbrmants were used for the con- 
firmation of raffinose synthetase expression. Overnight cultures of the translbrmants were inoculated at a volume of 1 
ml each into 100 ml of LB medium and incubated at 37°C for about 3 hours, followed by the addition of IPTG (isopro- 
pylthio-p-D-galactoside) to a final concentration of 1 mM and further incubation for 5 hours. The cultures were centri- 
fuged at 21,400 x g for 10 minutes, and the bacterial cells were collected. The collected bacterial cells were stored at - 
80°C. To the frozen bacterial cells was added a 10-fold volume of 100 mM Tris-HCI (pH 7.4), 1 mM EDTA, 5 mM DTT 
(drthiothrettol), 1 mM PMSF (phenylmethylsuHonyi fluoride) and 1 mM benzamide, and the bacterial cells were thawed 
and suspended. These suspensions were treated with an ultrasonic disrupter (Branson) to effect the disruption of the 
bacterial cells. The disrupted cell mixtures obtained were centrifuged at 16,000 x g for 10 minutes, and soluble protein 
solutions were collected. 

The protein solutions thus obtained were used at a volume of 4 uJ each for the measurement of raffinose syn- 
thetase activity according to the method described above. The reaction was effected at 37°C for 64 hours. As a control, 
£ coli strain HB101 that had been transformed with one of the vectors, pGEX-4T3, was used. The results are shown 
in Table 1. The synthesis of raffinose was detected in the samples from the transfer marrts HB101 (pGEX-RS) and 
HB101 (pTrc-RS). 


TABLE 1 


Transformant 

Amount of raffinose pro- 
duced (pmol) 

HB101 (DGEX4T-3) 

0.56 

HB101 (pGEX-RS) 

10.50 

HB101 (pTrc-RS) 

11.10 


Example 9 (Cloning of Raffinose Synthetase Gene from Soybean cDNA) 


In the same manner as described in Example 5, cDNA was obtained from immature seeds of soybean (Glycine 
max) Williams 82. Using this cDNA as a template and primers designed from the amino acid sequence of SEQ ID NO: 1 . 
i.e., primers having nucleotide sequences shown in list 8 below, a DNA fragment was amplified by PCR under the con- 
ditions described in Example 6. The DNA fragment thus amplified by PCR was cloned with a TA cloning kit (Invitrogen), 
followed by sequence reaction using ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit of PerWn-Elmer 
and nucleotide sequence analysis with a 373S DNA sequencer of ABI. Based on this sequence, primers having nucle- 
otide sequences shown in list 9 below were synthesized. The synthesis of cDNA was carried out with Marathon Kit of 
Clontech using mRNA obtained in the same manner as described in Example 5 from leaves of soybean Williams 82. 
The cDNA obtained was ligated to an adaptor contained in this kit with ligase. This operation was made according to 
the protocol attached. Using the adaptor-ligated cDNA thus prepared, polymerase chain reaction was effected with the 
primers shown in list 9 below. The nucleotide sequences in both terminal regions of the gene were analyzed according 
to the protocol attached to the Marathon Kit of Clontech. As a result, the nucleotide sequence of SEQ ID NO:4 was 
determined. 


(List 8) 

1- F primer 35mer 

CGATTIAAIGTITGGTGGACIACICAITGGGTIGG 

2- RV primer 45mer 

GGCCTAIAAIGCITCCCAIGTICACCAICCIAAITTITCIATIAT 

5- F primer 41 mer 

CGATGGATGGGIAAITTIATICAICCIGAITGGGAIATGTT 

6- RV primer 32mer 
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GGCCACAnrnACIA(AG)ICCIATIGGJGCIAA 


30 


(List 9) 


5 SN-1 30mer 

CACGAACTGGGGCACGAGACACAGATGATG 
SC-3RV 30mer 

AAGCAAGTCACGGAGTGTGAATAGTCAGAG 
SC-5 30mer 
70 ACACGAGACTGTTTGTTTGAAGACCCCTTG 
SC-6 25wer 

TGGAATCTCAACAAATATACAGGTG 
SN-3RV 30mer 

GGGTCATGGCCAACGTGGACGTATAAGCAC 
15 SN-4RV 30mer 

GATGATCACTGGCGCGGTTTTCTCCTCGAG 

Example 10 (Acquisition of Ratfinose Synthetase Gene from Japanese Artichoke cDNA) 

20 In the same manner as described in Example 5, cDNA was obtained from leaves of Japanese artichoke (Stachys 
sieboldii). Using this cDNA as a template and primers designed from the amino acid sequence of SEQ ID NO:1, i.e., 
primers having nucleotide sequences shown in list 10 below, a DNA fragment was amplified by PCR under the condi- 
tions described in Example 6. The DNA fragment thus amplified by PCR was cloned with a TA cloning kit (Invitrogen), 
followed by sequence reaction using ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit of Perkin-Elmer 

25 and nucleotide sequence analysis with a 373$ DNA sequencer of ABI. As a result, the nucleotide sequence of SEQ 
ID:6 was determined. 

Based on the nucleotide sequence thus obtained, synthesized DNA primers are prepared, and in the same manner 
as described in Example 9, the nucleotide sequences in both terminal regions of the gene are analyzed with Marathon 
Kitofaorrtech. 


(List 10) 


1- F primer 35mer 

CGATTiAAIGTITGGTGGACIACICAITGGGTIGG 
35 4-RV primer 37mer 

GGGCAGCIATIACICCITTICCITTIAAITGITTITT 

2- F primer 44mer 

CGAATIATIGAIAAITTIGGITGGTGIACITGGGAIGCITTITA 
6- RV primer 32mer 
40 GGCCACATITTIACIA(AG) ICCI ATIGG IGCI AA 

Example 1 1 (Acquisition of Raffinose Synthetase Gene from Corn cDNA) 

In the same manner as described in Example 5, cDNA was obtained from leaves of corn (Zea mays L.) Pioneer 
45 3358. Using this cDNA as a template and primers designed from the amino acid sequence of SEQ ID NO:1 , i.e., prim- 
ers having nucleotide sequences shown in list 1 1 below, a DNA fragment was amplified by PCR under the conditions 
described in Example 6. The DNA fragment thus amplified by PCR was cloned with a TA cloning kit (Invitrogen), fol- 
lowed by sequence reaction using ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit of PerWn-Elmer 
and nucleotide sequence analysis with a 373S DNA sequencer of ABI. Based on this sequence, primers having nucle- 
so otide sequences shown in list 12 below were synthesized. In the same manner as described in Example 5, mRNA 
obtained from leaves of corn (Zea mays L) Pioneer 3358 was linked to an adaptor contained in the Marathon Kit of 
Clontech with ligase. This operation was made according to the protocol attached. Using the adaptor-ligated cDNA thus 
prepared, polymerase chain reaction was effected in the same manner as described above with the primers shown in 
list 12 below. As a result, the nucleotide sequence of SEQ ID NO:8 was determined. 
$5 Based on the nucleotide sequence thus obtained, synthesized DNA primers are prepared, and in the same manner 
as described in Example 9, the nucleotide sequence in the 5-terminal region of the gene is analyzed with Marathon Kit 
of Clontech. 
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(Ust 11) 

5- F primer 41mer 

CQATGGATGGGIAAITTIAT1CAICCIGAITC3GQAIATGTT 

6- RV primer 32mer 

GGCCACATITTI AC! A( AG) ICCIATIGG IGC I AA 


(Ust 12) 

M-10 primer 25mer 

GACGTCGAGTGGAAGAGCGGCAAGG 
M- 11 primer 25mer 

CACCTACGAGCTCTTCGTCGTTGCC 


Example 12 (Construction of Expression Vectors in Plant for Chimera Gene, 35S-Broad Bean Raffinose Synthetase 
Gene) 

The plasmid pBluescriptKS-RS having the broad bean raffinose synthetase gene obtained in Example 7 was 
digested with the restriction endonucleases Bam HI and Sac I. Using Ligation Kit (Takara), the DNA fragment thus 
digested was cloned in the binary vector pBI1 21 (Clorrtech) previously digested with Bam HI and Sac I. The vector thus 
obtained was designated pBI121 -RS. 

For an antisense experiment, plasmid pBI121 (Clorrtech) previously digested with Bam HI and Sac I was ligated to 
linkers shown in list 13 below to give pBI121(-). This pBI121(-) was used to prepare pBI121(-)-RS in the same manner 
as described for the preparation of pBI121 -RS above. 

A similar vector was prepared with pBI221. The plasmid pBluescriptKS-RS obtained in Example 7 was digested 
with the restriction endonucleases Bam HI and Sac I. Using Ligation Kit (Takara), the DNA fragment thus digested was 
cloned in the vector pBI221 (Clorrtech) previously digested with Bam HI and Sac I. The vector thus obtained was des- 
ignated pBI221-RS. 

For an antisense experiment, plasmid pBI221 (Clorrtech) previously digested with Bam HI and Sac I was ligated to 
linkers shown in list 13 below to give pBI221(-). This pBI221(-) was used to prepare pBI221(-)-RS in the same manner 
as described for the preparation of pBI221-RS above. 

The construction of these expression vectors is shown in Figures 2 and 3. 


(Ust 13) 

BamSac-(+) linker 25mer 

GATCGAGCTCGTGTCGGATCCAGCT 
BamSac-(-) linker 17mer 

GGATCCGACACGAGCTC 


Example 13 (Transformation of Mustard with Broad Bean Raffinose Synthetase Gene) 

The vectors pBI121-RS and pBI121(-)-RS prepared in Example 12 were used for the transformation of mustard 
(Brassica juncia) by the Agrobacterium infection method. 

Agrobacterium tumefaciens (strain C58C1 , rifampicin resistant) previously made into a competent state by calcium 
chloride treatment was transformed independently with two pJasmids pBI121-RS and pBI121(-)-RS prepared in Exam- 
ple 12. Selection fortransformants was carried out on LB medium containing 50 ug/ml rifampicin and 25 ng/ml kanamy- 
cin by utilizing the character of kanamycin resistance conferred by the kanamycin resistance gene (neomycin 
phosphotransferase, NPTII) of the introduced plasmids. 

The transfbrmarrt Agrobacterium obtained (Agrobacterium tumefaciens strain C58, rifampicin resistant) was culti- 
vated on LB medium containing 50 jig/ml rifampicin and 25 ug/ml kanamycin at 28°C for a whole day and night, and the 
culture was used for the transformation of mustard by the method described below. 

The seeds of mustard were aseptically sowed on 1/2 MS medium, 2% sucrose, 0.7% agar. After one week, cotyle- 
dons and petioles of sprouting plants were cut out with a scalpel, and transferred to MS medium, 3% sucrose, 0.7% 
agar, 4.5 *iM BA, 0.05 jiM 2.4-D, 3.3 |iM AgN0 3 , followed by precultivation for 1 day. The precultivated cotyledons and 
petioles were transferred in a 1000-fold dilution of the Agrobacterium culture to cause infection for 5 minutes. The 
infected cotyledons and petioles were transferred again to the same medium as used in the precultivation, and culti- 
vated for 3 to 4 days. The cultivated cotyledons and petioles were transferred to MS medium, 3% sucrose, 4.5 \M BA, 
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0.05 |iM 2.4-D, 3.3 \ifJi AgN0 3 , 500 mg/l cefolaxim, and sterilized with shaking for 1 day. The sterilized cotyledons and 
petioles were transferred to MS medium, 3% sucrose, 0.7% agar, 4.5 uM BA, 0.05 yM 2.4-D, 3.3 uM AgN0 3 , 100 mg/l 
cefotaxim, 20 mg/l kanamycin, and cultivated for 3 to 4 weeks. The cotyledons and petioles were transferred to MS 
medium, 3% sucrose, 0.7% agar, 4.5 jiM BA, 0.05 *iM 2.4-D, 100 mg/I cefotaxim, 20 mg/l kanamycin, and cultivated. 
The cultivation on this medium was continued with subculturing at intervals of 3 to 4 weeks. When the regeneration of 
shoots began to occur, these shoots were subcultured on MS medium, 3% sucrose, 0.7% agar, 20 mg/l kanamycin, and 
cultivated for 3 to 4 weeks. The rooting plants were transferred to vermiculite : peat moss =1:1, and conditioned at 21 ° 
to 22°C in a cycle of day/night =12 hours : 12 hours. With the progress of plant body growth, the plants were suitably 
grown with cultivation soil. From leaves of the regenerated plants, genomic DNA was extracted according to the method 
described above, and the gene insertion into the plant genome was confirmed by PCR using the primers shown in list 
14 below. 

(List 14) 

35S 30mer 

TTCCAGTATGGACGATTCAAGGCTTGCTTC 
NOS 25mer 

ATGTATAATTGCGGGACTCTAATCA 
RS-F 30mer 

AAGAGTGTATCTGAATTTTCACGCGCGGTG 
RS-RV 33mer 

ACCTTCCCATACACCTTTTGGATGAACCTTCAA 

Example 14 (Transformation of Soybean Somatic Embryo with Broad Bean Raffinose Synthetase Gene) 

Cultured cells of soybean "Fayette 1 * somatic embryos (400 to 500 mg FW) were arranged in one layer within a circle 
having a diameter of 20 mm on the central part of a 6 cm agar plate. Two plasmids pBI221 -RS and pBI221 (-)-RS having 
chimera genes prepared from the broad bean raffinose synthetase gene and 35S promoter in Example 12 were intro- 
duced into the soybean somatic embryos according to the disclosure of the Japanese Patent Application No. 3- 
291501/1991. That is, these plasmids were mixed with the p-glucuronidase (GUS)/hygromycin-resistant gene (HPT) 
coexpression vector pSUM-GH:Notl for selection described in Soshiki Baiyo, 20, 323-327 (1994). These mixed plas- 
mids were used for the gene introduction into the soybean somatic embryos with a particle gun (800 mg/coating gold 
particles 200 ng/shot; projectile stopper-sample distance, 1 00 mm). After the introduction, gyratory cultures were grown 
in the MS modified growth liquid medium (Sigma) containing 25 to 50 jug/ml hygromycin under illumination at 25°C for 
16 hours, and transformed somatic embryos were selected. 

For the hygromycin-resistant soybean somatic embryos having yellowish green color and growth ability, which were 
selected after about 3 months, polymerization chain reaction is effected with primers shown in list 14 above to deter- 
mine whether the broad bean raffinose synthetase gene region is amplified or not. This confirms that the broad bean 
raffinose synthetase gene is inserted into the soybean genome. 

Furthermore, the somatic embryos obtained are used for the regeneration of plants to give transformant soybean 
with the broad bean raffinose synthetase gene. 

Medium Composition 

LB and MS media used in the above Examples have the following respective compositions. 


(LB medium) 


Bacto-tryptone 
Bacto-yeast extract 
NaCI 


10 g 

5g 

10g/1 liter H 2 0(pH 7.0) 
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(MS medium) 

KN0 3 

2022 mg/l 

NH4NO3 

1650 mg/l 

NH4CI 

2140 mg/l 

KH 2 P0 4 

170 mg/l 

MgS04 • 7H 2 0 

370 mg/l 

CaCI 2 * 2H 2 0 

440 mg/l 

MnS0 4 • 4H 2 0 

22.3 mg/l 

ZnS0 4 • 7H 2 0 

8.6 mg/l 

CuS0 4 • 5H 2 0 

0.025 mg/l 

Kl 

0.83 mg/l 

U0OI 2 • oM 2 L/ 

t\ HOC rwv/1 

u.u<£o mg/i 

H 3 B0 3 

6.2 mg/l 

NaMo04 • 2H 2 0 

0.25 mg/l 

FeS0 4 • 7H 2 0 

27.8 mg/l 

Na 2 EDTA 

37.3 mg/l 

Nicotinic add 

0.5 mg/l 

Thiamine HCI 

1 mg/l 

Pyridoxine HCI 

0.5 mg/l 

Inositol 

100 mg/l 

Glycine 

2 mg/l 


35 

Brief Description of the Sequences 
1.SEQ ID NO:1: 

40 The sequence of SEQ ID NO:1 shows an amino acid sequence of a raffinose synthetase protein encoded in the 
raff inose synthetase gene obtained from broad bean. 

2. SEQ ID NO:2: 

45 The sequence of SEQ ID NO:2 shows a cDNA nucleotide sequence of the raffinose synthetase gene obtained from 
broad bean. 

3. SEQ ID NO:3: 

so The sequence of SEQ ID NO:3 shows an amino acid sequence of a raffinose synthetase protein encoded in the 
raffinose synthetase gene obtained from soybean. 

4. SEQ ID NO:4: 

55 The sequence of SEQ ID NO:4 shows a cDNA nucleotide sequence of the raffinose synthetase gene obtained from 
soybean. 
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5. SEQ ID NO:5: 


The sequenc of SEQ ID NO:5 shows an amino acid sequenc of a raffinose synthetase protein encoded in the 
raffinose synthetase gene obtained from Japanese artichoke. 

5 

6. SEQ ID NO:6: 


The sequence of SEQ ID NO:6 shows a cDNA nucleotide sequence of the raffinose synthetase gene obtained from 
Japanese artichoke. 

10 

7. SEQ ID NO:7: 


The sequence of SEQ ID NO:7 shows an amino acid sequence of a raffinose synthetase protein encoded in the 
raffinose synthetase gene obtained from corn. 

15 

8. SEQ ID NO:8: 


Hie sequence of SEQ ID NO:8 shows a cDNA nucleotide sequence of the raffinose synthetase gene obtained from 
corn. 

20 

9. LisM: 


The nucleotide sequences shown in list 1 are of the typical primers used in the amplification of a cDNA fragment of 
a raffinose synthetase gene. All of these sequences are based on the nucleotide sequence in the non-coding region of 

25 the gene. Primer 1 is a sense primer corresponding to the 5'-terminus of a cDNA fragment of the broad bean-derived 
raffinose synthetase gene. Primers 2 and 3 are antisense primers corresponding to the 3Merminus of the cDNA frag- 
ment of the broad bean-derived raffinose synthetase gene. Primer 4 is a sense primer corresponding to the S'-terminus 
of a cDNA fragment of the soybean-derived raffinose synthetase gene. Primers 5 and 6 are antisense primers corre- 
sponding to the 3'-terminus of the cDNA fragment of the soybean-derived raffinose synthetase gene. Depending upon 

30 the purpose, recognition sequences for suitable restriction endonucleases can be added to the S'-termini of these 
nucleotide sequences in an appropriate manner. 


10. List 2: 


35 The nucleotide sequences shown in list 2 are of the typical primers used in the amplification of an open reading 
frame coding for the amino acid sequence of a raffinose synthetase protein in the cDNA sequence of a raffinose syn- 
thetase gene. Primers 1 and 2 are sense primers corresponding to the N-terminus of the broad bean-derived raffinose 
synthetase protein. Primers 3 and 4 are antisense primers corresponding to the C-terminus of the broad bean-derived 
raffinose synthetase protein. Primers 5 and 6 are sense primers corresponding to the N-terminus of the soybean- 

40 derived raffinose synthetase protein. Primers 7 and 8 are antisense primers corresponding to the C-terminus of the soy- 
bean-derived raffinose synthetase protein. Depending upon the purpose, recognition sequences for suitable restriction 
endonucleases can be added to the 5' termini of these sequences in an appropriate manner. 


45 


11. List 3: 

The amino acid sequences shown in list 3 are partial amino acid sequences of a raffinose synthetase protein. 


#1 is equivalent to the partial amino acid sequence extending from amino acid 1 10 to amino acid 129 in the amino 
acid sequence of SEQ ID NO:1 . 
so #2 is equivalent to the partial amino acid sequence extending from amino acid 234 to amino acid 247 in the amino 
acid sequence of SEQ ID NO:1. 

#3 is equivalent to the partial amino acid sequence extending from amino acid 265 to amino acid 279 in the amino 
acid sequence of SEQ ID NO:1 . 

#4 is equivalent to the partial amino acid sequence extending from amino acid 296 to amino acid 312 in the amino 
55 acid sequence of SEQ ID NO:1 . 

#5 is equivalent to the partial amino acid sequence extending from amino acid 346 to amino acid 361 in the amino 
acid sequenc of SEQ ID NO:1 . 

#6 is equivalent to the partial amino acid sequence extending from amino acid 383 to amino acid 402 in the amino 
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acid sequence of SEQ ID NO:1. 

#7 is equivalent to the partial amino acid sequence extending from amino acid 41 1 to amino acid 433 
acid sequence of SEQ ID NO:1. 

#8 is equivalent to the partial amino acid sequence extending from amino acid 440 to amino acid 453 
s acid sequence of SEQ ID NO:1 . 

#9 is equivalent to the partial amino acid sequence extending from amino acid 457 to amino acid 468 
acid sequence of SEQ ID NO:1 . 

#10 is equivalent to the partial amino acid sequence extending from amino acid 471 to amino acid 516 
acid sequence of SEQ ID NO:1. 
io #1 1 is equivalent to the partial amino acid sequence extending from amino acid 51 7 to amino acid 559 
acid sequence of SEQ ID NO:1. 

#12 is equivalent to the partial amino acid sequence extending from amino acid 574 to amino acid 582 
acid sequence of SEQ ID NO:1 . 

#13 is equivalent to the partial amino acid sequence extending from amino acid 586 to amino acid 609 
is acid sequence of SEQ ID NO:1 . 

#1 4 is equivalent to the partial amino acid sequence extending from amino acid 615 to amino acid 627 
acid sequence of SEQ ID NO:1. 

#1 5 is equivalent to the partial amino acid sequence extending from amino acid 71 6 to amino acid 724 
acid sequence of SEQ ID NO:1 . 

20 

12. List 4: 

The nucleotide sequences shown in list 4 are of the typical primers synthesized on some of the amino acid 
sequences shown in list 3. The symbol "F" as used after the primer number means that the primer referred to by this 

25 symbol has a sense sequence. The symbol "RV as used after the primer number means that the primer referred to by 
this symbol has an antisense sequence. Primer 1 corresponds to the partial amino acid sequence extending from 
amino acid 119 to amino acid 129 in the amino acid sequence of SEQ ID NO:1. Primer 2 corresponds to the partial 
amino acid sequence extending from amino acid 234 to amino acid 247 in the amino acid sequence of SEQ ID NO:1. 
Primer 3 corresponds to the partial amino acid sequence extending from amino acid 265 to amino acid 279 in the amino 

30 acid sequence of SEQ ID NO:1. Primer 4 corresponds to the partial amino acid sequence extending from amino acid 
458 to amino acid 468 in the amino acid sequence of SEQ ID NO:1. Primer 5 corresponds to the partial amino acid 
sequence extending from amino acid 522 to amino acid 534 in the amino acid sequence of SEQ ID NO: 1 . Primer 6 cor- 
responds to the partial amino acid sequence extending from amino acid 716 to amino acid 724 in the amino acid 
s quence of SEQ ID NO:1 . 

35 

13. List 5: 

The nucleotide sequences shown in list 5 are of the typical primers synthesized on the partial amino acid 
sequences of the purified broad bean raff inose synthetase protein. The bases shown in parentheses mean that a mix- 
40 ture of those bases was used in the synthesis. The symbol "RV" as used after the primer number means that the primer 
referred to by this symbol has an antisense sequence. 

14. List 6: 

45 The nucleotide sequences shown in list 6 are of the typical primers used in the analysis of both terminal regions of 
a cDNA nucleotide sequence of the broad bean raffinose synthetase gene by the RACE method. The symbol "RV" as 
used after the primer number means that the primer referred to by this symbol has an antisense sequence. 

15. List 7: 

50 

The nucleotide sequences shown in list 7 are of the typical primers used in the cloning of the broad bean raffinose 
synthetase gene. RS-N corresponds to the N-terminus of the open reading frame and contains two recognition sites for 
the restriction endonucleases Bam HI and Nco I on the SMerminal side. RS-C is an antisense primer corresponding to 
the C-terminus of the open reading frame and contains a recognition site for the restriction endonuclease Xba I on the 
55 5'-terminal side. 
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16. List 8: 

The nucleotide sequences shown in list 8 are of the typical primers used in the cloning of a soybean raff inose syn- 
thetase gene fragment. The base represented by the symbol T was inosine used in the synthesis. The symbol n RV" as 
5 used after the primer number means that the primer referred to by this symbol has an antisense sequence. 

17. List 9: 

The nucleotide sequences shown in list 9 are of the typical primers used in the analysis of the cDNA nucleotide 
10 sequence of a soybean raff inose synthetase gene fragment. The symbol "RV as used after the primer number means 
that the primer referred to by this symbol has an antisense sequence. 

The analysis of nucleotide sequences was carried out by polymerase chain reaction using SN-1 and SC-3RV. SC- 
5 and SC-6 were used in the analysis of a nucleotide sequence in the 3*-terminal region, and SN-3RV and SN-4RV were 
used in the analysis of a nucleotide sequence in the 5'-terminal region. 

15 

18. List 10: 

The nucleotide sequences shown in list 10 are of the typical primers used in the analysis of the cDNA nucleotide 
sequence of a Japanese artichoke raffinose synthetase gene fragment. The base represented by the symbol T was 
20 inosine used in the synthesis. The symbol "RV" as used after the primer number means that the primer referred to by 
this symbol has an antisense sequence. 

19. List 11: 

25 The nucleotide sequences shown in list 1 1 are of the typical primers used in the analysis of the cDNA nucleotide 
sequence of a corn raffinose synthetase gene fragment. The base represented by the symbol T was inosine used in 
the synthesis. The symbol "RV" as used after the primer number means that the primer referred to by this symbol has 
an antisense sequence. 

30 20. List 12: 

The nucleotide sequences shown in list 12 are of the typical primers used in the analysis of the cDNA nucleotide 
sequence of a corn raffinose synthetase gene fragment. M-10 and M-11 were used in the analysis of a nucleotide 
sequence in the 3'-terminal region, 

35 

21. List 13: 

The nucleotide sequences shown in list 13 are of the typical adopters used in the construction of vectors for anti- 
sense experiments. These synthetic DNA fragments takes a double-stranded form when mixed together because they 
40 are complementary strands. This double-stranded DNA fragment has cohesive ends of cleavage sites for the restriction 
endonucleases Bam HI and Sac I on both termini, and contains the restriction sites for the restriction endonucleases 
Bam HI and Sac I in the double-stranded region. 

22. List 14: 

45 

The nucleotide sequences shown in list 14 are of the typical primers used in the PCR experiment to confirm the 
gene introduction into the genome of a recombinant plant. 35S is a primer toward the downstream region at the 35S 
promoter site, and NOS is a primer toward the upstream region at the NOS terminator she. RS-F is a sense primer of 
the broad bean raffinose synthetase gene, and RS-RV is an antisense primer of the broad bean raffinose synthetase 
so gene. 


55 
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(1) GENERAL INFORMATION: 

(i) APPLICANT; 

<A) NAME : Sumitomo Chemical Co., Ltd. 

(B) STREET: 5-33, Kitahama 4-chome, Chuo-ku 

(C) CITY: Osaka- shi, Osaka™ fu 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP) : none 

(ii) TITLE OF INVENTION: Raffinose synthetase genes and the 

use thereof 

(iii) NUMBER OF SEQUENCES: 8 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

<C) OPERATING SYSTEM: PS -DOS/MS -DOS 

<D) SOFTWARE: Patent In Release #1.0, Version #1.25(EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 97 12 2417.5 

(B) FILING DATE : 18-DEC-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 338673/1996 

(B) FILING DATE: 18-DEC-1996 


(1} INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 


(vi) ORIGINAL SOURCE: 

(A) ORGANISM; broad bean (Vicia faba) 

(B) STRAIN: Nintoku Issun 
(F) TISSUE TYPE: seeds 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 


Met Ala Pro Pro Ser lie Thr Lys 

1 5 
Ser Thr lie Asp lie Gly Asn Gly 
20 

Leu Asp Gin Ser Arg Asp Phe Leu 
35 40 
Gin Val Pro Pro Asn lie Thr Thr 

50 55 
Phe Leu Asn Leu Lys Ser Asn Lys 
65 70 


Thr Ala Thr Leu Gin Asp Val lie 

10 15 
Asn Ser Pro Leu Phe Ser lie Thr 

25 30 
Ala Asn Gly His Pro Phe Leu Thr 
45 

Thr Thr Thr Thr Thr Ala Ser Ser 
60 

Asp Thr lie Pro Asn Asn Asn Asn 
75 80 
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Thr 

Met 

Leu 

Leu 

Gin 
85 

Gin 

Gly 

Cys 

Phe 

Val 
90 

Gly Phe 

Asn 

Ser 

Thr 
95 

Glu 


Pro 

Lys 

Ser 

His 

His 

Val 

Val 

Pro 

Leu Gly 

Lys Leu 

Lys 

Gly 

He 

Lys 

5 



100 





105 




110 




Phe 

Met 

Ser 
115 

He 

Phe 

Arg 

Phe 

Lys 
120 

Val 

Trp 

Trp Thr 

Thr 
125 

His 

Trp 

Val 


Gly 

Thr 

Asn 

Gly 

Gin 

Glu 

Leu 

Gin 

His 

Glu 

Thr Gin 

Met 

Leu 

He 

Leu 


130 





135 




140 





10 

Asp 

Lys 

Asn 

Asp 

Ser 

Leu 

Gly 

Arg 

Pro 

Tyr 

Val Leu 

Leu 

Leu 

Pro 

He 

145 





150 





155 




160 


Leu 

Glu 

Asn 

Thr 

Phe 
165 

Arg 

Thr 

Ser 

Leu 

Gin 
170 

Pro Gly 

Leu 

Asn 

Asp 
175 

His 


He 

Gly 

Met 

Ser 

Val 

Glu 

Ser 

Gly 

Ser 

Thr 

His Val 

Thr 

Gly 

Ser 

Ser 




180 





185 




190 



15 

Phe 

Lys 

Ala 

Cys 

Leu 

Tyr 

He 

His 

Leu 

Ser 

Asn Asp 

Pro 

Tyr 

Ser 

He 



195 





200 




205 





Leu 

Lys 
210 

Glu 

Ala 

Val 

Lys 

val 
215 

He 

Gin 

Thr 

Gin Leu 
220 

Gly 

Thr 

Phe 

Lys 


Thr 

Leu 

Glu 

Glu 

Lys 

Thr 

Ala 

Pro 

Ser 

He 

He Asp 

Lys 

Phe 

Gly 

Trp 


225 





230 





235 




240 

20 

Cys 

Thr 

Trp 

Asp 

Ala 

Phe 

Tyr 

Leu 

Lys 

Val 

His Pro 

Lys 

Gly 

Val 

Trp 





245 





250 




255 



Glu 

Gly 

Val 

Lys 

Ser 

Leu 

Thr 

Asp 

Gly Gly 

Cys Pro 

Pro 

Gly 

Phe 

Val 





260 





265 




270 




He 

He 

Asp 

Asp 

Gly 

Trp 

Gin 

Ser 

He 

Cys 

His Asp 

Asp 

Asp 

Asp 

Glu 

25 



275 





280 




285 




Asp 

Asp 
290 

Ser 

Gly 

Met 

Asn 

Arg 
295 

Thr 

Ser 

Ala 

Gly Glu 
300 

Gin 

Met 

Pro 

Cys 


Arg 

Leu 

Val 

Lys 

Tyr 

Glu 

Glu 

Asn 

Ser 

Lys 

Phe Arg 

Glu 

Tyr 

Glu 

Asn 


30S 





310 





315 






Pro 

Glu 

Asn 

Gly 

Gly 

Lys 

Lys 

Gly 

Leu 

Gly 

Gly Phe 

Val 

Arg 

Asp 

Leu 

30 

320 




325 





330 




335 


Lys 

Glu 

Glu 

Phe 
340 

Gly 

Ser 

Val 

Glu 

Ser 
345 

Val 

Tyr Val 

Trp 

His 
350 

Ala 

Leu 


Cys 

Gly 

Tyr 
355 

Trp 

Gly 

Gly 

Val 

Arg 
360 

Pro 

Gly 

Val His 

Gly 
365 

Met 

Pro 

Lys 

35 

Ala 

Arg 
370 

Val 

Val 

Val 

Pro 

Lys 
375 

Val 

Ser 

Gin 

Gly Leu 
380 

Lys 

Met 

Thr 

Met 


Glu 

Asp 

Leu 

Ala 

Val 

Asp 

Lys 

He 

Val 

Glu 

Asn Gly 

Val 

Gly 

Leu 

val 


385 





390 





395 




400 


Pro 

Pro 

Asp 

Phe 

Ala 
405 

His 

Glu 

Met 

Phe 

Asp 
410 

Gly Leu 

His 

Ser 

His 
415 

Leu 

40 

Glu 

Ser 

Ala 

Gly 

He 

Asp 

Gly 

Val 

Lys 

Val 

Asp Val 

He 

His 

Leu 

Leu 




420 





425 




430 




Glu 

Leu 

Leu 

Ser 

Glu 

Glu 

Tyr 

Gly 

Gly Arg 

Val Glu 

Leu 

Ala 

Arg 

Ala 




435 





440 




445 





Tyr 

Tyr 
450 

Lys 

Ala 

Leu 

Thr 

Ser 
455 

Ser 

Val 

Lys 

Lys His 
460 

Phe 

Lys 

Gly 

Asn 

45 

Gly 

Val 

He 

Ala 

Ser 

Met 

Glu 

His 

Cys 

Asn 

Asp Phe 

Phe 

Leu 

Leu 

Gly 

465 





470 





475 




480 


Thr 

Glu 

Ala 

He 

Ser 
485 

Leu 

Gly 

Arg 

Val 

Gly 
490 

Asp Asp 

Phe 

Trp 

Cys 
495 

Ser 


Asp 

Pro 

Ser 

Gly 

Asp 

Pro 

Asn 

Gly 

Thr Tyr 

Trp Leu 

Gin 

Gly 

Cys 

His 





500 





505 




510 



50 

Met 

Val 

His 

Cys 

Ala 

Tyr 

Asn 

Ser 

Leu Trp 

Met Gly 

Asn 

Phe 

He 

Gin 




515 





520 




525 





Pro 

Asp 
530 

Trp 

Asp 

Met 

Phe 

Gin 
535 

Ser 

Thr 

His 

Pro Cys 
540 

Ala 

Glu 

Phe 

His 
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5 


15 


20 


Ala 

Ala 

Ser 

Arg 

Ala 

He 

Ser 

Gly Gly Pro He 

Tyr 

Val 

Ser 

Asp Cys 

545 





550 


555 




560 

val 

Gly 

Asn 

His 

Asn 

Phe 

Lys 

Leu Leu Lys Ser 

Leu 

Val 

Leu 

Pro Asp 





565 



570 




575 

Gly 

Ser 

He 

Leu 

Arg 

Cys 

Gin 

His Tyr Ala Leu 

Pro 

Thr 

Arg Asp Cys 




580 




585 



590 


Leu 

Phe 

Glu 

Asp 

Pro 

Leu 

His 

Asn Gly Lys Thr 

Met 

Leu 

Lys 

He Trp 



595 





600 


605 



Asn 

Leu 

Asn 

Lys 

Tyr 

Thr 

Gly 

Val Leu Gly Leu 

Phe 

Asn 

Cys 

Gin Gly 


610 





615 


620 




Gly 

Gly 

Trp 

Cys 

Pro 

Glu 

Ala 

Arg Arg Asn Lys 

Ser 

Val 

Ser 

Glu Phe 

625 





630 


635 




640 

Ser 

Arg 

Ala 

Val 

Thr 

Cys 

Tyr 

Ala Ser Pro Glu 

Asp 

Xle 

Glu 

Trp Cys 





645 



650 




655 

Asn 

Gly 

Lys 

Thr 

Pro 

Met 

Ser 

Thr Lys Gly Val 

Asp 

Phe 

Phe 

Ala Val 




660 




665 



670 


Tyr 

Phe 

Phe 

Lys 

Glu 

Lys 

Lys 

Leu Arg Leu Met 

Lys 

Cys 

Ser Asp Arg 



675 





680 


685 



Leu 

Lys 

Val 

Ser 

Leu 

Glu 

Pro 

Phe Ser Phe Glu 

Leu 

Met 

Thr 

Val Ser 


690 





6 95 


700 




Pro 

Val 

L Y 3 

Val 

Phe 

Ser 

Lys 

Arg Phe He Gin 

Phe 

Ala 

Pro 

He Gly 

705 





710 


715 




720 

Leu 

Val 

Asn 

Met 

Leu 

Asn 

Ser 

Gly Gly Ala He 

Gin 

Ser 

Leu 

Glu Phe 





725 



730 




735 

Asp 

Asp 

Asn 

Ala 

Ser 

Leu 

Val 

Lys He Gly Val 

Arg 

Gly 

Cys 

Gly Glu 




74 0 




745 



750 


Met 

Ser 

Val 

Phe 

Ala 

Ser 

Glu 

Lys Pro Val Cys 

Cys 

Lys 

He 

Asp Gly 



755 





760 


765 



Val 

Lys 

Val 

Lys 

Phe 

Leu 

Tyr 

Glu Asp Lys Met 

Ala 

Arg 

Val 

Gin He 


770 





775 


780 




Leu 

Trp 

Pro 

Ser 

Ser 

Ser 

Thr 

Leu Ser Leu Val 

Gin 

Phe 

Leu 

Phe Stop 

785 





790 


795 




800 


(1) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 


40 


45 


(ii) MOLECULE TYPE: cDNA to mRNA 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 101.. 2500 

(C) IDENTIFICATION METHOD: by experiment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 


AATTTTCAAG CATAGCCAAG TTAACCACCT TAGAAACATT CCTACAAGCT ACTTATCCCT 60 
GTCAATAAGC TACTAAGCTA CCAGAGTCTC ATCAATCACC ATG GCA CCA CCA AGC 115 

Met Ala Pro Pro Ser 

50 5 

ATA ACC AAA ACT GCA ACC CTC CAA GAC GTA ATA AGC ACC ATC GAT ATT 163 
He Thr Lys Thr Ala Thr Leu Gin Asp Val He Ser Thr He Asp He 
10 15 20 
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w 


20 


GGT AAT GGT AAC TCA CCC TTA TTC TCC ATA ACC TTA GAC CAA TCA CGT 211 
Gly Asn Gly Asn Ser Pro Leu Phe Ser lie Thr Leu Asp Gin Ser Arg 

25 30 35 

GAC TTC CTT GCA AAT GGC CAC CCT TTC CTC ACC CAA GTC CCA CCT AAC 259 
Asp Phe Leu Ala Asn Gly His Pro Phe Leu Thr Gin Val Pro Pro Asn 

40 45 50 

ATA ACA ACA ACA ACA ACA ACC ACT GCT TCC TCT TTT CTC AAT CTC AAA 307 
lie Thr Thr Thr Thr Thr Thr Thr Ala Ser Ser Phe Leu Asn Leu Lys 

55 60 65 

TCC AAC AAA GAT ACC ATT CCC AAC AAC AAC AAC ACC ATG TTG TTG CAA 355 
Ser Asn Lys Asp Thr lie Pro Asn Asn Asn Asn Thr Met Leu Leu Gin 
70 75 80 85 

CAA GGT TGT TTC GTT GGT TTC AAC TCC ACC GAA CCC AAA AGC CAC CAC 403 
Gin Gly Cys Phe Val Gly Phe Asn Ser Thr Glu Pro Lys Ser His His 

90 95 100 

GTA GTT CCA CTC GGC AAA CTA AAA GGA ATC AAA TTC ATG AGC ATA TTC 451 
Val Val Pro Leu Gly Lys Leu Lys Gly He Lys Phe Met Ser He Phe 

105 110 115 

CGG TTC AAA GTT TGG TGG ACA ACT CAC TGG GTC GGA ACA AAT GGA CAG 499 
Arg Phe Lys Val Trp Trp Thr Thr His Trp Val Gly Thr Asn Gly Gin 

120 125 130 

GAA CTA CAA CAC GAA ACA CAA ATG TTA ATC CTG GAC AAA AAC GAC TCC 547 
Glu Leu Gin His Glu Thr Gin Met Leu He Leu Asp Lys Asn Asp Ser 

135 140 145 

CTC GGA CGA CCC TAT GTC TTA CTC CTC CCA ATC CTA GAA AAC ACC TTC 595 
Leu Gly Arg Pro Tyr Val Leu Leu Leu Pro He Leu Glu Asn Thr Phe 
150 155 160 165 

CGA ACC TCA CTC CAA CCC GGT CTC AAC GAT CAC ATA GGC ATG TCC GTC 643 
Arg Thr Ser Leu Gin Pro Gly Leu Asn Asp His He Gly Met Ser Val 

170 175 180 

GAA AGC GGT TCA ACA CAT GTC ACC GGG TCA AGC TTC AAA GCA TGT CTT 691 
Glu Ser Gly Ser Thr His Val Thr Gly Ser Ser Phe Lys Ala Cys Leu 

185 190 195 

TAC ATC CAT CTC AGT AAC GAC CCA TAG AGT ATA CTA AAA GAA GCA GTT 739 
Tyr He His Leu Ser Asn Asp Pro Tyr Ser He Leu Lys Glu Ala Val 

200 205 210 

AAA GTA ATC CAA ACT CAG TTA GGA ACA TTC AAG ACT CTT GAA GAA AAA 787 
Lys Val He Gin Thr Gin Leu Gly Thr Phe Lys Thr Leu Glu Glu Lys 

215 220 225 

ACA GCA CCT AGT ATT ATA GAC AAA TTC GGT TGG TGC ACG TGG GAT GCT 835 
Thr Ala Pro Ser He He Asp Lys Phe Gly Trp Cys Thr Trp Asp Ala 
230 235 240 245 

40 TTT TAC TTG AAG GTT CAT CCA AAA GGT GTA TGG GAA GGT GTA AAG TCT 883 

Phe Tyr Leu Lys Val His Pro Lys Gly Val Trp Glu Gly Val Lys Ser 

250 255 260 

CTC ACA GAT GGT GGT TGT CCT CCC GGT TTC GTC ATA ATC GAC GAC GGT 931 
Leu Thr Asp Gly Gly Cys Pro Pro Gly Phe Val He He Asp Asp Gly 

265 270 275 

TGG CAA TCC ATT TGT CAT GAC GAT GAC GAT GAA GAT GAT TCA GGA ATG 979 
Trp Gin Ser He Cys His Asp Asp Asp Asp Glu Asp Asp Ser Gly Met 

280 285 290 

AAC CGA ACC TCA GCC GGG GAA CAA ATG CCA TGC AGA CTT GTA AAA TAC 1027 
Asn Arg Thr Ser Ala Gly Glu Gin Met Pro Cys Arg Leu Val Lys Tyr 
50 295 300 305 

GAA GAG AAT TCT AAG TTT AGA GAA TAT GAG AAT CCT GAA AAT GGA GGG 1075 
Glu Glu Asn Ser Lys Phe Arg Glu Tyr Glu Asn Pro Glu Asn Gly Gly 
310 315 320 325 


35 


45 
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10 


20 


AAG AAA GGT TTG GGT GGT TTT GTG AGG GAT TTG AAG GAA GAG TTT GGG 1123 
Lys Lys Gly Leu Gly Gly Phe Val Arg Asp Leu Lys Glu Glu Phe Gly 

330 335 340 

AGT GTG GAG AGT GTT TAT GTT TGG CAT GCG CTT TGT GGG TAT TGG GGC 1171 
Ser Val Glu Ser Val Tyr Val Trp His Ala Leu Cys Gly Tyr Trp Gly 

345 350 355 

GGG GTT AGG CCT GGA GTG CAT GGG ATG CCG AAA GCT AGG GTT GTT GTT 1219 
Gly Val Arg Pro Gly Val His Gly Met Pro Lys Ala Arg Val Val Val 

360 365 370 

CCG AAG GTG TCT CAG GGG TTG AAG ATG ACG ATG GAG GAT TTG GCG GTG 1267 
Pro Lys Val Ser Gin Gly Leu Lys Met Thr Met Glu Asp Leu Ala Val 

375 380 385 

GAT AAG ATT GTT GAG AAC GGT GTG GGG CTA GTG CCG CCA GAT TTT GGA 1315 
Asp Lys He val Glu Asn Gly Val Gly Leu Val Pro Pro Asp Phe Ala 
390 395 400 405 

CAT GAG ATG TTT GAT GGG CTT CAC TCT CAT TTG GAG TCG GCG GGA ATT 1363 
His Glu Met Phe Asp Gly Leu His Ser His Leu Glu Ser Ala Gly lie 

410 415 420 

GAC GGT GTT AAA GTT GAT GTT ATC CAT CTG CTT GAG TTA CTA TCA GAG 1411 
Asp Gly Val Lys Val Asp Val He His Leu Leu Glu Leu Leu Ser Glu 

425 430 435 

GAA TAT GGT GGA CGA GTT GAG CTA GCA AGA GCT TAT TAC AAA GCA CTA 1459 
Glu Tyr Gly Gly Arg Val Glu Leu Ala Arg Ala Tyr Tyr Lys Ala Leu 

440 445 450 

ACC TCA TCA GTG AAG AAA CAT TTC AAA GGC AAT GGT GTA ATT GCT AGC 1507 
25 Thr Ser Ser Val Lys Lys His Phe Lys Gly Asn Gly Val He Ala Ser 

455 460 465 

ATG GAG CAT TGC AAC GAC TTC TTT CTC CTC GGC ACC GAA GCC ATA TCC 1555 
Met Glu His Cys Asn Asp Phe Phe Leu Leu Gly Thr Glu Ala He Ser 
470 475 480 485 

CTC GGC CGC GTC GGA GAT GAT TTT TGG TGC TCT GAT CCA TCT GGT GAT 1603 
30 Leu Gly Arg Val Gly Asp Asp Phe Trp Cys Ser Asp Pro Ser Gly Asp 

490 495 500 

CCA AAT GGT ACA TAT TGG CTC CAA GGT TGT CAC ATG GTA CAT TGT GCC 1651 
Pro Asn Gly Thr Tyr Trp Leu Gin Gly Cys His Met Val His Cys Ala 

505 510 515 

TAC AAC AGT TTA TGG ATG GGA AAT TTC ATT CAG CCA GAT TGG GAC ATG 1699 
Tyr Asn Ser Leu Trp Met Gly Asn Phe He Gin Pro Asp Trp Asp Met 

520 525 530 

TTT CAG TCC ACT CAT CCT TGT GCT GAA TTT CAT GCC GCC TCA CGA GCC 1747 
Phe Gin Ser Thr His Pro Cys Ala Glu Phe His Ala Ala Ser Arg Ala 
535 540 545 

40 ATA TCC GGC GGA CCA ATT TAT GTT AGT GAT TGT GTT GGT AAT CAC AAT 1795 

He Ser Gly Gly Pro He Tyr Val Ser Asp Cys Val Gly Asn His Asn 
550 555 560 565 

TTC AAG TTG CTC AAA TCT CTT GTT TTG CCC GAT GGT TCT ATC TTG CGT 1843 
Phe Lys Leu Leu Lys Ser Leu Val Leu Pro Asp Gly Ser He Leu Arg 
570 575 580 

45 TGT CAA CAT TAC GCA CTC CCT ACA AGA GAT TGC TTG TTT GAA GAC CCT 1891 

Cys Gin His Tyr Ala Leu Pro Thr Arg Asp Cys Leu Phe Glu Asp Pro 

585 590 595 

TTG CAT AAT GGC AAA ACA ATG CTG AAA ATT TGG AAT CTC AAC AAA TAT 1939 
Leu His Asn Gly Lys Thr Met Leu Lys He Trp Asn Leu Asn Lys Tyr 

600 605 610 

ACA GGT GTT TTG GGT CTT TTC AAC TGC CAA GGT GGT GGG TGG TGT CCT 1987 
Thr Gly Val Leu Gly Leu Phe Asn Cys Gin Gly Gly Gly Trp Cys Pro 
615 620 625 
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GAG GCA CGG CGA AAC AAG AGT GTA TCT GAA TTT TCA CGC GCG GTG ACA 2035 

Glu Ala Arg Arg Asn Lys Ser Val Ser Glu Phe Ser Arg Ala Val Thr 

630 635 640 645 

TGT TAT GCA AGT CCC GAA GAC ATT GAA TGG TGC AAT GGG AAA ACT CCA 2083 

Cys Tyr Ala Ser Pro Glu Asp He Glu Trp Cys Asn Gly Lys Thr Pro 

650 655 660 

ATG AGC ACC AAA GGT GTG GAT TTT TTT GCT GTG TAT TTT TTC AAG GAG 2131 
Met Ser Thr Lys Gly Val Asp Phe Phe Ala Val Tyr Phe Phe Lys Glu 

665 670 675 

AAG AAA TTG AGG CTC ATG AAG TGT TCT GAT AGA TTG AAA GTT TCG CTT 2179 
Lys Lys Leu Arg Leu Met Lys Cys Ser Asp Arg Leu Lys Val Ser Leu 

680 685 690 

GAG CCA TTT AGT TTT GAG CTA ATG ACA GTG TCT CCA GTG AAA GTG TTT 2227 
Glu Pro Phe Ser Phe Glu Leu Met Thr Val Ser Pro Val Lys Val Phe 

695 700 705 

TCG AAA AGG TTT ATA CAG TTT GCA CCG ATT GGG TTA GTG AAC ATG CTG 2275 
Ser Lys Arg Phe He Gin Phe Ala Pro He Gly Leu Val Asn Met Leu 
710 715 720 725 

AAC TCT GGT GGT GCG ATT CAG TCT CTG GAG TTT GAT GAT AAT GCA AGT 2323 
Asn Ser Gly Gly Ala He Gin Ser Leu Glu Phe Asp Asp Asn Ala Ser 

730 735 740 

TTG GTC AAG ATT GGG GTG AGA GGT TGC GGG GAG ATG AGC GTG TTT GCG 2371 
Leu Val Lys He Gly Val Arg Gly Cys Gly Glu Met Ser Val Phe Ala 

745 750 755 

TCT GAG AAA CCG GTT TGC TGC AAA ATT GAT GGG GTT AAG GTG AAA TTT 2419 
Ser Glu Lys Pro Val Cys Cys Lys He Asp Gly Val Lys Val Lys Phe 

760 765 770 

CTT TAT GAG GAC AAA ATG GCA AGA GTT CAA ATT CTG TGG CCT AGT TCT 2467 
Leu Tyr Glu Asp Lys Met Ala Arg Val Gin He Leu Trp Pro Ser Ser 

775 780 785 

TCA ACA TTG TCT TTG GTC CAG TTT TTA TTT TGA TCC CTAGGAA TCCTATGCAC 2520 
Ser Thr Leu Ser Leu Val Gin Phe Leu Phe Stop 
790 795 800 

GTGTCTCTGT TTACAAGTAC TTTATATAAG TATAATATGT ATCTATTTCC ATTTTTAACT 2580 
GTCTTTATGC AATTAGGTGG TCAATTAGTT ATTTGTTTGT GAAGTAACTA ACTTGCTTGT 2640 
GTTGTAAGCT TATAATATAT GGTCAAGTTC CTCACTTGTA TATACCTGTT GTATGTATAA 2700 
ATTTTACTAT ATATGACTAA CATCATTATC TTG TG AG CAA AAAAAA 2746 


(1) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: soybean (Glycine max) 

(B) STRAIN: Williams 82 
<F) TISSUE TYPE: seeds and leaves 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

50 Met Ala Pro Ser He Ser Lys Thr Val Glu Leu Asn Ser Phe Gly Leu 

5 10 15 

Val Asn Gly Asn Leu Pro Leu Ser He Thr Leu Glu Gly Ser Asn Phe 
20 25 30 
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(1) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2598 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(ix) FEATURE: 

(A) NAME /KEY : peptide 

(B) LOCATION: 62.. 2407 

(C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CCAAACCATA GCAAACCTAA GCACCAAACC TCTTTCTTTC AAGATCCTTG AATTCAGTCC 
C ATG GCT CCA AGC ATA AGC AAA ACT GTG GAA CTA AAT TCA TTT GGT 
Met Ala Pro Ser He Ser Lys Thr Val Glu Leu Asn Ser Phe Gly 
5 10 15 
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CTT GTC AAC GGT AAT TTG CCT TTG TCC ATA ACC CTA GAA GGA TCA AAT 154 
Leu Val Asn Gly Asn Leu Pro Leu Ser lie Thr Leu Glu Gly Ser Asn 

20 25 30 

TTC CTC GCC AAC GGC CAC CCT TIT CTC ACG GAA GTT CCC GAA AAC ATA 2 02 

Phe Leu Ala Asn Gly His Pro Phe Leu Thr Glu Val Pro Glu Asn lie 

35 40 45 

ATA GTC ACC CCT TCA CCC ATC GAC GCC AAG AGT AGT AAG AAC AAC GAG 250 
lie Val Thr Pro Ser Pro lie Asp Ala Lys Ser Ser Lys Asn Asn Glu 

50 55 60 

GAC GAC GAC GTC GTA GGT TGC TTC GTG GGC TTC CAC GCG GAC GAG CCC 298 
Asp Asp Asp Val Val Gly Cys Phe Val Gly Phe His Ala Asp Glu Pro 

65 70 75 

AGA AGC CGA CAC GTG GCT TCC CTG GGG AAG CTC AGA GGA ATA AAA TTC 346 
Arg Ser Arg His Val Ala Ser Leu Gly Lys Leu Arg Gly lie Lys Phe 
80 85 90 95 

ATG AGC ATA TTC CGG TTT AAG GTG TGG TGG ACC ACT CAC TGG GTC GGT 394 
Met Ser lie Phe Arg Phe Lys Val Trp Trp Thr Thr His Trp Val Gly 

100 105 110 

AGC AAC GGA CAC GAA CTG GAG CAC GAG ACA CAG ATG ATG CTT CTC GAC 442 
Ser Asn Gly His Glu Leu Glu His Glu Thr Gin Met Met Leu Leu Asp 

115 120 125 

AAA AAC GAC CAG CTC GGA CGC CCC TTT GTG TTG ATT CTC CCG ATC CTC 4 90 

Lys Asn Asp Gin Leu Gly Arg Pro Phe Val Leu lie Leu Pro lie Leu 

130 135 140 

CAA GCC TCG TTC CGA GCC TCC CTG CAA CCC GGT TTG GAT GAT TAC GTG 538 
Gin Ala Ser Phe Arg Ala Ser Leu Gin Pro Gly Leu Asp Asp Tyr Val 

145 150 155 

GAC GTT TGC ATG GAG AGC GGG TCG ACA CGT GTC TGT GGC TCC AGC TTC 586 
Asp Val Cys Met Glu Ser Gly Ser Thr Arg Val Cys Gly Ser Ser Phe 
160 165 170 175 

GGG AGC TGC TTA TAC GTC CAC GTT GGC CAT GAC CCG TAT CAG TTG CTT 634 
Gly Ser Cys Leu Tyr Val His Val Gly His Asp Pro Tyr Gin Leu Leu 

180 185 190 

AGA GAA GCA ACT AAA GTC GTT AGG ATG CAT TTG GGG ACG TTC AAG CTT 682 
Arg Glu Ala Thr Lys Val val Arg Met His Leu Gly Thr Phe Lys Leu 

195 200 205 

CTC GAG GAG AAA ACC GCG CCA GTG ATC ATA GAC AAG TTT GGT TGG TGT 730 
Leu Glu Glu Lys Thr Ala Pro Val lie lie Asp Lys Phe Gly Trp Cys 

210 215 220 

ACA TGG GAC GCG TTT TAC TTG AAG GTG CAT CCC TCA GGT GTG TGG GAA 778 
Thr Trp Asp Ala Phe Tyr Leu Lys Val His Pro Ser Gly Val Trp Glu 
225 230 235 

40 GGG GTG AAA GGG TTG GTG GAG GGA GGG TGC CCT CCA GGG ATG GTC CTA 826 

Gly Val Lys Gly Leu Val Glu Gly Gly Cys Pro Pro Gly Met Val Leu 
240 245 250 255 

ATC GAC GAC GGG TGG CAA GCC ATT TGT CAC GAC GAG GAC CCC ATA ACG 874 
He Asp Asp Gly Trp Gin Ala He Cy s Hij s Asp G lu As P Pro Ile Tnr 

260 265 270 

GAC CAA GAG GGT ATG AAG CGA ACC TCC GCA GGG GAG CAA ATG CCA TGC 922 
Asp Gin Glu Gly Met Lys Arg Thr Ser Ala Gly Glu Gin Met Pro Cys 

275 280 285 

AGG TTG GTG AAG TTG GAG GAA AAT TAC AAG TTC AGA CAG TAT TGT AGT 970 
Arg Leu Val Lys Leu Glu Glu Asn Tyr Lys Phe Arg Gin Tyr Cys Ser 

290 295 300 

GGA AAG GAT TCT GAG AAG GGT ATG GGT GCC TTT GTT AGG GAC TTG AAG 1018 
Gly Lys Asp Ser Glu Lys Gly Met Gly Ala Phe Val Arg Asp Leu Lys 
305 310 315 
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GAA CAG TTT AGG AGC GTG GAG CAG GTG TAT GTG TGG CAC GCG CTT TGT 1066 
Glu Gin Phe Arg Ser Val Glu Gin Val Tyr Val Trp His Ala Leu Cys 
320 325 330 335 

5 GGG TAT TGG GGT GGG GTC AGA CCC AAG GTT CCG GGC ATG CCC CAG GCT 1114 

Gly Tyr Trp Gly Gly Val Arg Pro Lys Val Pro Gly Met Pro Gin Ala 

340 345 350 

AAG GTT GTC ACT CCG AAG CTG TCC AAT GGA CTA AAA TTG ACA ATG AAG 1162 
Lys Val Val Thr Pro Lys Leu Ser Asn Gly Leu Lys Leu Thr Met Lys 
355 360 365 

1° GAT TTA GCG GTG GAT AAG ATC GTC AGT AAC GGA GTT GGA CTG GTG CCA 1210 

Asp Leu Ala Val Asp Lys lie Val Ser Asn Gly Val Gly Leu Val Pro 

370 375 380 

CCA CAC CTG GCT CAC CTT TTG TAC GAG GGG CTC CAC TCC CGT TTG GAA 1258 
Pro His Leu Ala His Leu Leu Tyr Glu Gly Leu His Ser Arg Leu Glu 

385 390 395 

TCT GCG GGT ATT GAC GGT GTT AAG GTT GAC GTT ATA CAC TTG CTC GAG 1306 
Ser Ala Gly He Asp Gly Val Lys Val Asp Val He His Leu Leu Glu 
400 405 410 415 

ATG CTA TCC GAG GAA TAC GGT GGC CGT GTT GAG CTA GCC AAA GCT TAT 1354 
Met Leu Ser Glu Glu Tyr Gly Gly Arg Val Glu Leu Ala Lys Ala Tyr 

420 425 430 

TAC AAA GCG CTC ACT GCT TCG GTG AAG AAG CAT TTC AAA GGC AAT GGG 1402 
Tyr Lys Ala Leu Thr Ala Ser Val Lys Lys His Phe Lys Gly Asn Gly 

435 440 445 

GTC ATT GCG AGC ATG GAG CAT TGT AAT GAC TTC TTT CTC CTT GGT ACC 1450 
Val He Ala Ser Met Glu His Cys Asn Asp Phe Phe Leu Leu Gly Thr 

450 455 460 

GAA GCC ATA GCC CTT GGG CGC GTA GGA GAT GAT TTT TGG TGC ACT GAT 14 98 
Glu Ala lie Ala Leu Gly Arg Val Gly Asp Asp Phe Trp Cys Thr Asp 

465 470 475 

CCC TCT GGA GAT CCA AAT GGC ACG TAT TGG CTC CAA GGG TGT CAC ATG 1546 
30 pro Ser Gly Asp Pro Asn Gly Thr Tyr Trp Leu Gin Gly Cys His Met 

480 485 490 495 

GTG CAC TGT GCC TAC AAC AGC TTG TGG ATG GGG AAT TTT ATT CAG CCG 1594 
Val His Cys Ala Tyr Asn Ser Leu Trp Met Gly Asn Phe He Gin Pro 

500 505 510 

GAT TGG GAC ATG TTC CAG TCC ACT CAC CCT TGT GCC GAA TTC CAT GC 1642 
3 Asp Trp Asp Met Phe Gin Ser Thr His Pro Cys Ala Glu Phe His Ala 

515 520 525 

GCC TCT AGG GCC ATC TCT GGT GGA CCA GTT TAC GTT AGT GAT TGT GTT 1690 
Ala Ser Arg Ala He Ser Gly Gly Pro Val Tyr Val Ser Asp Cys Val 
530 535 540 

40 GGA AAG CAC AAC TTC AAG TTG CTC AAG AGC CTC GCT TTG CCT GAT GGG 1738 

Gly Lys His Asn Phe Lys Leu Leu Lys Ser Leu Ala Leu Pro Asp Gly 

545 550 555 

ACG ATT TTG CGT TGT CAA CAC TAT GCA CTC CCC ACA CGA GAC TGT TTG 1786 
Thr He Leu Arg Cys Gin His Tyr Ala Leu Pro Thr Arg Asp. Cys Leu 
560 565 570 575 

TTT GAA GAC CCC TTG CAT GAT GGG AAG ACA ATG CTC AAA ATT TGG AAT 1834 
Phe Glu Asp Pro Leu His Asp Gly Lys Thr Met Leu Lys He Trp Asn 

580 585 590 

CTC AAC AAA TAT ACA GGT GTT TTG GGT CTA TTT AAT TGC CAA GGA GGT 1882 
Leu Asn Lys Tyr Thr Gly Val Leu Gly Leu Phe Asn Cys Gin Gly Gly 
50 5 9 5 6 0 0 6 05 

GGG TGG TGT CCC GTA ACT AGG AGA AAC AAG AGT GCC TCT GAA TTT TCA 1930 
Gly Trp Cys Pro Val Thr Arg Arg Asn Lys Ser Ala Ser Glu Phe Ser 
610 615 620 
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CAA ACT GTG ACA TGC TTA GCG AGT CCT CAA GAC ATT GAA TGG AGC AAT 1978 
Gin Thr Val Thr Cys Leu Ala Ser Pro Gin Asp lie Glu Trp Ser Asn 

625 630 635 

GGG AAA AGC CCA ATA TGC ATA AAA GGG ATG AAT GTG TTT GCT GTA TAT 2026 
Gly Lys Ser Pro lie Cys lie Lys Gly Met Asn Val Phe Ala Val Tyr 
640 645 650 655 

TTG TTC AAG GAC CAC AAA CTA AAG CTC ATG AAG GCA TCA GAG AAA TTG 2074 
Leu Phe Lys Asp His Lys Leu Lys Leu Met Lys Ala Ser Glu Lys Leu 

660 665 670 

GAA GTT TCA CTT GAG CCA TTT ACT TTT GAG CTA TTG ACA GTG TCT CCA 2122 
Glu Val Ser Leu Glu Pro Phe Thr Phe Glu Leu Leu Thr Val Ser Pro 

675 680 685 

GTG ATT GTG CTG TCA AAA AAG TTA ATT CAA TTT GCT CCA ATT GGA TTA 2170 
Val lie Val Leu Ser Lys Lys Leu He Gin Phe Ala Pro He Gly Leu 

690 695 700 

GTG AAC ATG CTT AAC ACT GGT GGT GCC ATT CAG TCC ATG GAG TTT GAC 2218 
Val Asn Met Leu Asn Thr Gly Gly Ala He Gin Ser Met Glu Phe Asp 

705 710 715 

AAC CAC ATA GAT GTG GTC AAA ATT GGG GTT AGG GGT TGT GGG GAG ATG 2266 
Asn His He Asp Val Val Lys He Gly Val Arg Gly Cys Gly Glu Met 
20 72 0 7 2 5 7 3 0 7 35 

AAG GTG TTT GCA TCA GAG AAA CCA GTT AGT TGC AAA CTA GAT GGG GTA 2314 
Lys Val Phe Ala Ser Glu Lys Pro Val Ser Cys Lys Leu Asp Gly Val 

740 745 750 

GTT GTA AAA TTT GAT TAT GAG GAT AAA ATG CTG AGA GTG CAA GTT CCC 2362 
Val Val Lys Phe Asp Tyr Glu Asp Lys Met Leu Arg Val Gin Val Pro 
25 755 760 765 

TGG CCT AGT GCT TCA AAA TTG TCA ATG GTT GAG TTT TTA TTT TGA TCCCT 2412 
Trp Pro Ser Ala Ser Lys Leu Ser Met Val Glu Phe Leu Phe Stop 

770 775 780 

GAAGGTGAAT TTGGGATACT ATGATGTTTG ACTCTCTTTT TAAGTAATAA GAGTCATATT 2472 
on TTTCTGTTGT AAAAAAAAAA AAAAAA 2498 


(1) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 587 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Japanese artichoke (Stachys sieboldii) 
(F) TISSUE TYPE: leaves 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

45 Thr Asn Gly Ser Asp Leu Glu Arg Glu Thr Gin He Val Val Leu Asp 

15 10 15 

Lys Ser Asp Asp Arg Pro Tyr He Val Leu Leu Pro Leu He Glu Gly 

20 25 30 

Gin Phe Arg Ala Ser Leu Gin Pro Gly Val Asp Asp Phe He Asp He 
35 40 45 

50 cys Val Glu Ser Gly Ser Thr Lys Val Asn Glu Ser Ser Phe Arg Ala 

50 55 60 

Ser Leu Tyr Met His Ala Gly Asp Asp Pro Phe Thr Leu Val Lys Asp 
65 70 75 80 


55 
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Ala 

Val 

Lys 

Val 

Ala 

Arg 

His 

His 

Leu 

Gly Thr 

Phe 

Arg 

Leu 

Leu Glu 





85 





90 




95 

Glu 

Lys 

Thr 

Pro 

Pro 

Gly 

He 

Val 

ASp 

Lys Phe 

Gly 

Trp 

Cys 

Thr Trp 




100 





105 




110 


Asp 

Ala 

Phe 

Tyr 

Leu 

Asn 

Val 

Gin 

Pro 

His Gly Val 

Met 

Glu 

Gly Val 



115 





120 




125 



Gin 

Gly 

Leu 

Val 

Asp 

Gly 

Gly 

Cys 

Pro 

Pro Gly 

Leu 

Val 

Leu 

He Asp 


130 





135 




140 




Asp 

Gly 

Trp 

Gin 

Ser 

He 

Cys 

His 

Asp 

Asn Asp 

Ala 

Leu 

Thr 

Thr Glu 

145 





150 




155 




160 

Gly 

Met 

Gly 

Arg 

Thr 

Ser 

Ala 

Gly 

Glu 

Gin Met 

Pro 

Cys 

Arg 

Leu He 





165 





170 




175 

Lys 

Phe 

Glu 

Glu 

Asn 

Tyr 

Lys 

Phe 

Arg 

Glu Tyr 

Glu 

Ser 

Pro 

Asn Lys 




180 





185 




190 


Thr 

Gly 

Pro 

Gly 

Pro 

Asn 

Thr 

Gly 

Met 

Gly Ala 

Phe 

lie 

Arg 

Asp Met 



195 





200 




205 



Lys 

Asp 

Asn 

Phe 

Lys 

Ser 

Val 

Asp 

Tyr 

Val Tyr 

Val 

Trp 

His 

Ala Leu 


210 





215 




220 




Cys 

Gly 

Tyr 

Trp 

Gly 

Gly 

Leu 

Arg 

Pro 

Asn Val 

Pro 

Gly 

Leu 

Pro Glu 

225 





230 




235 




240 

Ala 

Lys 

Leu 

lie 

Glu 

Pro 

Lys 

Leu 

Thr 

Pro Gly Leu 

Lys 

Thr 

Thr Met 





245 





250 




255 

Glu 

Asp 

Leu 

Ala 

Val 

Asp 

Lys 

He 

Val 

Asn Asn Gly 

Val 

Gly 

Leu Val 




260 





265 




270 


Pro 

Pro 

Glu 

Phe 

Val 

Glu 

Gin 

Met 

Tyr 

Glu Gly Leu 

His 

Ser 

His Leu 



275 





280 




285 



Glu 

Ser 

Val 

Gly 

He 

Asp 

Gly 

Val 

Lys 

Val Asp 

Val 

He 

His 

Leu Leu 


290 





295 




300 




Glu 

Met 

Leu 

cys 

Glu 

Asp 

Tyr 

Gly 

Gly Arg Val Asp 

Leu 

Ala 

Lys Ala 

305 





310 




315 




320 

Tvr 

Tyr 

Lys 

Ala 

Leu 

Ser 

Ser 

Ser 

val 

Asn Asn 

His 

Phe 

Asn 

Gly Asn 




325 





330 




335 

Gly 

Val 

lie 

Ala 

Gly 

Leu 

Glu 

His 

Cys 

Asn Asp 

Phe 

Met 

Phe 

Leu Gly 




340 





345 




350 


Thr 

Glu 

Ala 

He 

Thr 

Leu 

Gly 

Arg 

Val 

Gly Asp Asp 

Phe 

Trp 

Cys Thr 



355 





360 




365 



Asp 

Pro 

Ser 

Gly 

Asp 

Pro 

Asn 

Gly 

Thr 

Phe Trp 

Leu 

Gin 

Gly 

Cys His 


370 





375 




380 




Met 

Val 

His 

Cys 

Ala 

Tyr 

Asn 

Ser 

He 

Trp Met 

Gly 

Asn 

Phe 

He His 

385 





390 




395 




400 

Pro 

Asp 

Trp 

Asp 

Met 

Phe 

Gin 

Ser 

Thr 

His Pro 

Cys 

Ala 

Glu 

Phe His 





405 





410 




415 

Ala 

Ala 

Ser 

Arg 

Ala 

He 

Ser 

Gly 

Gly 

Pro He 

Tyr 

Val 

Ser 

Asp Ser 




420 





425 




430 


val 

Gly 

Lys 

His 

Asn 

Phe 

Glu 

Leu 

Leu Arg Ser 

Leu 

Val 

Leu 

Pro Asp 



435 





440 




445 



Gly 

Ser 

He 

Leu 

Arg 

Cys 

ASp 

Tyr 

Tyr 

Ala Leu 

Pro 

Thr 

Arg 

Asp Cys 


450 





455 




460 




Leu 

Phe 

Glu 

Asp 

Pro 

Leu 

His 

Asn 

Gly Lys Thr 

Met 

Leu 

Lys 

He Trp 

465 





470 




475 




480 

Asn 

Tyr 

Asn 

Lys 

Phe 

Thr 

Gly 

Val 

Val 

Gly Thr 

Phe 

Asn 

Cys 

Gin Gly 





485 





490 




495 

Gly 

Gly 

Trp 

Ser 

Arg 

Glu 

Val 

Arg 

Arg Asn Gin 

Cys 

Ala 

Ala 

Glu Tyr 




500 





505 




510 


Ser 

His 

Ala 

Val 

Ser 

Ser 

Ser 

Ala 

Gly 

Pro Ser 

Asp 

He 

Glu 

Trp Lys 



515 





520 




525 



Gin 

Gly 

Thr 

Ser 

Pro 

He 

Asp 

val 

Asp Gly val 

Lys 

Thr 

Phe 

Ala Leu 


530 





535 




540 





35 
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Tyr Leu Phe His Glu Lys Lys Leu Val Leu Ser Lys Pro Ser Asp Lys 
545 550 555 560 

lie Asp lie Thr Leu Glu Pro Phe Asp Phe Glu Leu lie Thr Val Ser 

565 570 575 

Pro Val Lys Thr Leu Ala Asn Cys Thr Val Gin 
580 585 


(1) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 


{ ix) FEATURE : 

(A) NAME/KEY: peptide 
20 (B) LOCATION: 2.. 1762 

<C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

G ACA AAC GGG TCG GAT CTT GAG CGG GAA ACT CAA ATA GTC GTG CTC 46 
25 Thr Asn Gly Ser Asp Leu Glu Arg Glu Thr Gin He Val Val Leu 

15 10 15 

GAC AAG TCC GAC GAC AGG CCC TAC ATC GTG CTG CTT CCG CTC ATC GAG 94 
Asp Lys Ser Asp Asp Arg Pro Tyr He Val Leu Leu Pro Leu He Glu 

20 25 30 

GGG CAG TTT CGG GCT TCC CTT CAG CCC GGT GTG GAT GAT TTT ATC GAT 142 
Gly Gin Phe Arg Ala Ser Leu Gin Pro Gly Val Asp Asp Phe He Asp 

35 40 45 

ATT TGT GTC GAA AGC GGG TCA ACC AAG GTC AAC GAG TCC TCG TTC CGT 190 
He Cys Val Glu Ser Gly Ser Thr Lys Val Asn Glu Ser Ser Phe Arg 

50 55 60 

GCT TCG CTC TAC ATG CAC GCC GGT GAT GAC CCT TTT ACC CTG GTG AAG 238 
Ala Ser Leu Tyr Met His Ala Gly Asp Asp Pro Phe Thr Leu Val Lys 

65 70 75 

GAC GCC GTG AAG GTG GCG CGC CAC CAC CTC GGG ACG TTC AGG CTG CTG 286 
Asp Ala Val Lys Val Ala Arg His His Leu Gly Thr Phe Arg Leu Leu 
80 85 90 95 

40 GAG GAG AAA ACT CCG CCG GGG ATC GTC GAC AAA TTC GGG TGG TGC ACG 334 

Glu Glu Lys Thr Pro Pro Gly He Val Asp Lys Phe Gly Trp Cys Thr 

100 105 110 

TGG GAT GCG TTC TAC CTC AAC GTC CAG CCC CAC GGC GTT ATG GAG GGC 382 
Trp Asp Ala Phe Tyr Leu Asn Val Gin Pro His Gly Val Met Glu Gly 
115 120 125 

45 GTG CAG GGG CTG GTT GAC GGC GGA TGT CCG CCG GGG CTG GTG TTG ATC 430 

Val Gin Gly Leu Val Asp Gly Gly Cys Pro Pro Gly Leu Val Leu He 

130 135 140 

GAC GAC GGG TGG CAG TCC ATT TGT CAC GAC AAC GAC GCG CTC ACC ACC 478 
Asp Asp Gly Trp Gin Ser He Cys His Asp Asn Asp Ala Leu Thr Thr 
SQ 145 150 155 

GAG GGG ATG GGG AGA ACC TCC GCC GGA GAG CAA ATG CCC TGC AGG TTG 526 
Glu Gly Met Gly Arg Thr Ser Ala Gly Glu Gin Met Pro Cys Arg Leu 
160 165 170 175 
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ATC AAG TTT GAG GAG AAT TAC AAG TTC AGG GAG TAC GAG AGC CCG AAT 574 
lie Lys Phe Glu Glu Asn Tyr Lys Phe Arg Glu Tyr Glu Ser Pro Asn 

180 185 190 

AAA ACT GGG CCG GGC CCG AAT ACG GGG ATG GGG GCC TTT ATT CGT GAC 622 
Lys Thr Gly Pro Gly Pro Asn Thr Gly Met Gly Ala Phe lie Arg Asp 

195 200 205 

ATG AAG GAC AAT TTC AAG AGT GTG GAC TAC GTG TAC GTG TGG CAT GCG 670 
Met Lys Asp Asn Phe Lys Ser Val Asp Tyr Val Tyr Val Trp His Ala 

210 215 220 

TTG TGT GGT TAT TGG GGC GGG CTC AGG CCC AAT GTT CCG GGC CTG CCC 718 
Leu Cys Gly Tyr Trp Gly Gly Leu Arg Pro Asn Val Pro Gly Leu Pro 

225 230 235 

GAG GCT AAG CTC ATT GAG CCC AAA CTG ACT CCT GGG CTT AAG ACC ACC 766 
Glu Ala Lys Leu lie Glu Pro Lys Leu Thr Pro Gly Leu Lys Thr Thr 
15 240 245 250 255 

ATG GAA GAT TTG GCT GTT GAT AAG ATT GTC AAC AAT GGC GTG GGT CTG 814 
Met Glu Asp Leu Ala Val Asp Lys lie Val Asn Asn Gly Val Gly Leu 

260 265 270 

GTC CCA CCG GAG TTT GTT GAA CAA ATG TAT GAA GGA TTA CAT TCA CAT 862 
Val Pro Pro Glu Phe Val Glu Gin Met Tyr Glu Gly Leu His Ser His 

275 280 285 

CTC GAA TCT GTG GGG ATT GAT GGA GTC AAA GTT GAC GTC ATC CAT TTG 910 
Leu Glu Ser Val Gly lie Asp Gly Val Lys Val Asp Val lie His Leu 

290 295 300 

TTG GAA ATG TTG TGT GAA GAC TAT GGT GGG AGA GTG GAC TTA GCC AAG 958 
25 Leu Glu Met Leu Cys Glu Asp Tyr Gly Gly Arg Val Asp Leu Ala Lys 

305 310 315 

GCT TAT TAC AAG GCC TTA TCA AGC TCA GTT AAC AAC CAC TTC AAC GGC 1006 
Ala Tyr Tyr Lys Ala Leu Ser Ser Ser Val Asn Asn His Phe Asn Gly 
320 325 330 335 

AAC GGC GTC ATC GCT GGC CTG GAG CAC TGC AAT GAC TTC ATG TTT CTC 1054 
30 Asn Gly Val lie Ala Gly Leu Glu His Cys Asn Asp Phe Met Phe Leu 

340 345 350 

GGA ACC GAG GCC ATT ACC TTG GGT CGT GTC GGG GAT GAT TTT TGG TGC 1102 
Gly Thr Glu Ala lie Thr Leu Gly Arg Val Gly Asp Asp Phe Trp Cys 
355 360 365 

35 ACT GAT CCA TCT GGA GAT CCC AAT GGC ACG TTC TGG TTG CAA GGG TGT 1150 

Thr Asp Pro Ser Gly Asp Pro Asn Gly Thr Phe Trp Leu Gin Gly Cys 

370 375 380 

CAC ATG GTG CAC TGC GCC TAC AAC AGC ATA TGG ATG GGT AAT TTC ATC 1198 
His Met Val His Cys Ala Tyr Asn Ser He Trp Met Gly Asn Phe lie 
385 390 395 

*0 CAC CCT GAT TGG GAC ATG TTT CAA TCG ACT CAC CCT TGC GCT GAA TTC 1246 

His Pro Asp Trp Asp Met Phe Gin Ser Thr His Pro Cys Ala Glu Phe 
400 405 410 415 

CAC GCT GCC TCA CGA GCC ATC TCC GGC GGG CCC ATT TAC GTC AGT GAC 1294 
His Ala Ala Ser Arg Ala He Ser Gly Gly Pro He Tyr Val Ser Asp 

420 425 430 

TCG GTC GGA AAG CAC AAC TTC GAG CTC CTT AGG AGC CTC GTT CTT CCC 1342 
Ser Val Gly Lys His Asn Phe Glu Leu Leu Arg Ser Leu Val Leu Pro 

435 440 445 

GAT GGC TCC ATC CTC CGT TGT GAT TAC TAC GCG CTT CCG ACT CGC GAT 13 90 
Asp Gly Ser He Leu Arg Cys Asp Tyr Tyr Ala Leu Pro Thr Arg Asp 
50 450 455 460 

TGC CTC TTT GAA GAT CCA CTT CAC AAT GGC AAG ACT ATG CTC AAA ATT 14 38 
Cys Leu Phe Glu Asp Pro Leu His Asn Gly Lys Thr Met Leu Lys He 
465 470 475 


45 
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TGG AAT TAT AAC AAG TTC ACC GGA GTT GTC GGA ACT TTC AAC TGC CAA 1486 
Trp Asn Tyr Asn Lys Phe Thr Gly Val Val Gly Thr Phe Asn Cys Gin 
480 485 490 495 

5 GGT GGC GGG TGG AGC CGG GAA GTG CGT CGC AAC CAA TGC GCT GCC GAG 1534 

Gly Gly Gly Trp Ser Arg Glu Val Arg Arg Asn Gin Cys Ala Ala Glu 

500 505 510 

TAT TCC CAC GCC GTC TCC TCT AGC GCT GGT CCG AGT GAC ATT GAG TGG 1582 
Tyr Ser His Ala Val Ser Ser Ser Ala Gly Pro Ser Asp lie Glu Trp 
515 520 525 

10 AAG CAA GGA ACG AGT CCG ATC GAC GTC GAC GGC GTC AAA ACA TTC GCG 1630 

Lys Gin Gly Thr Ser Pro lie Asp Val Asp Gly Val Lys Thr Phe Ala 

530 535 540 

TTG TAC CTA TTC CAC GAG AAG AAA CTC GTC CTT TCT AAG CCA TCA GAC 1678 
Leu Tyr Leu Phe His Glu Lys Lys Leu Val Leu Ser Lys Pro Ser Asp 

1S 545 550 555 

AAA ATC GAC ATC ACG CTT GAG CCC TTC GAT TTT GAG CTG ATA ACC GTT 1726 

Lys He Asp He Thr Leu Glu Pro Phe Asp Phe Glu Leu He Thr Val 

560 565 570 575 

TCT CCA GTC AAA ACT CTA GCC AAT TGC ACC GTC CAA 1762 

Ser Pro Val Lys Thr Leu Ala Asn Cys Thr Val Gin 

20 580 585 


30 


(1) INFORMATION FOR SEQ ID NO: 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(vi) ORIGINAL SOURCE : 

(A) ORGANISM: corn (Zea mays L.) 

(B) STRAIN: Pioneer 3358 
CF) TISSUE TYPE: leaves 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

35 

Gin Ser Thr His Pro Cys Ala Ala Phe His Ala Ala Ser Arg Ala He 

5 10 15 

Ser Gly Gly Pro He Tyr Val Ser Asp Ser Val Gly Gin His Asp Phe 

20 25 30 

Ala Leu Leu Arg Arg Leu Ala Leu Pro Asp Gly Thr Val Leu Arg Cys 
40 35 40 45 

Glu Gly His Ala Leu Pro Thr Arg Asp Cys Leu Phe Ala Asp Pro Leu 

50 55 60 

His Asp Gly Arg Thr Val Leu Lys He Trp Asn Val Asn Arg Phe Ala 
65 70 75 80 

Gly Val val Gly Ala Phe Asn Cys Gin Gly Gly Gly Trp Ser Pro Glu 
45 8 5 9 0 9 5 

Ala Arg Arg Asn Lys Cys Phe Ser Glu Phe Ser Val Pro Leu Ala Ala 

100 105 110 

Arg Ala Ser Pro Ser Asp Val Glu Trp Lys Ser Gly Lys Ala Gly Pro 

115 120 125 

Gly Val Ser Val Lys Asp Val Ser Gin Phe Ala Val Tyr Ala Val Glu 

130 135 140 

Ala Arg Thr Leu Gin Leu Leu Arg Pro Asp Glu Gly Val Asp Leu Thr 
145 150 155 160 


50 
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is 


40 


Leu Gin Pro Phe Thr Tyr Glu Leu Phe Val Val Ala Pro Val Arg Val 

165 170 175 

lie Ser His Glu Arg Ala lie Lys Phe Ala Pro He Gly Leu Ala Asn 

180 185 190 

Met Leu Asn Thr Ala Gly Ala Val Gin Ala Phe Glu Ala Lys Lys Asp 

195 200 205 

Ala Ser Gly Val Thr Ala Glu Val Phe Val Lys Gly Ala Gly Glu Leu 

210 215 220 

Val Ala Tyr Ser Ser Ala Thr Pro Arg Leu Cys Lys Val Asn Gly Asp 
225 230 235 240 

Glu Ala Glu Phe Thr Tyr Lys Asp Gly Val Val Thr Val Asp Val Pro 

245 250 255 

Trp Ser Gly Ser Ser Ser Lys Leu Cys Cys Val Gin Tyr Val Tyr Stop 
260 265 270 


(1) INFORMATION FOR SEQ ID N0:8: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA to mRNA 

(ix) FEATURE: 

(A) NAME/ KEY: peptide 

(B) LOCATION: 2.. 817 

(C) IDENTIFICATION METHOD: by experiment 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

C CAG TCC ACG CAC CCC TGC GCC GCC TTC CAC GCC GCG TCC CGC GCC 46 
Gin Ser Thr His Pro Cys Ala Ala Phe His Ala Ala Ser Arg Ala 
5 10 15 

ATC TCC GGC GGG CCC ATC TAC GTC AGC GAC TCG GTG GGG CAG CAC GAC 94 
He Ser Gly Gly Pro lie Tyr Val Ser Asp Ser Val Gly Gin His Asp 

20 25 30 

TTC GCG CTG CTC CGC CGC CTG GCG CTC CCC GAC GGC ACC GTC CTC CGG 142 
Phe Ala Leu Leu Arg Arg Leu Ala Leu Pro Asp Gly Thr Val Leu A 

35 40 45 

TGC GAG GGC CAC GCG CTG CCC ACG CGC GAC TGC CTC TTC GCC GAC CCG 190 
Cys Glu Gly His Ala Leu Pro Thr Arg Asp Cys Leu Phe Ala Asp Pro 

50 55 60 

CTC CAC GAC GGC CGG ACC GTG CTC AAG ATC TGG AAC GTG AAC CGC TTC 238 
Leu His Asp Gly Arg Thr Val Leu Lys He Trp Asn Val Asn Arg Phe 
65 70 75 

45 GCC GGC GTC GTC GGC GCC TTC AAC TGC CAG GGC GGC GGG TGG AGC CCC 286 

Ala Gly Val Val Gly Ala Phe Asn Cys Gin Gly Gly Gly Trp Ser Pro 
80 85 90 95 

GAG GCG CGG CGG AAC AAG TGC TTC TCG GAG TTC TCC GTG CCC CTG GCC 3 34 

Glu Ala Arg Arg Asn Lys Cys Phe Ser Glu Phe Ser Val Pro Leu Ala 

100 105 110 

GCG CGC GCC TCG CCG TCC GAC GTC GAG TGG AAG AGC GGC AAG GCG GGG 382 
Ala Arg Ala Ser Pro Ser Asp Val Glu Trp Lys Ser Gly Lys Ala Gly 
115 120 125 


SO 
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CCA GGC GTC AGC GTC AAG GAC GTC TCC CAG TTC GCC GTG TAC GCG GTC 43 0 

Pro Gly Val Ser Val Lys Asp Val Ser GXn Phe Ala Val Tyr Ala Val 

130 135 140 

GAG GCC AGG ACG CTG CAG CTG CTG CGC CCC GAC GAG GGC GTC GAC CTC 478 
Glu Ala Arg Thr Leu Gin Leu Leu Arg Pro Asp Glu Gly Val Asp Leu 

145 150 155 

ACG CTG CAG CCC TTC ACC TAC GAG CTC TTC GTC GTT GCC CCC GTG CGC 526 
Thr Leu Gin Pro Phe Thr Tyr Glu Leu Phe Val Val Ala Pro Val Arg 
160 165 170 175 

GTC ATC TCG CAT GAG CGG GCC ATC AAG TTC GCG CCC ATC GGA CTC GCC 574 
Val lie Ser His Glu Arg Ala He Lys Phe Ala Pro lie Gly Leu Ala 

180 185 190 

AAC ATG CTC AAC ACC GCC GGC GCC GTG CAG GCG TTC GAG GCC AAG AAA 622 
Asn Met Leu Asn Thr Ala Gly Ala Val Gin Ala Phe Glu Ala Lys Lys 
1S 195 200 205 

GAT GCT AGC GGC GTC ACG GCA GAG GTG TTC GTG AAG GGC GCA GGG GAG 670 
Asp Ala Ser Gly Val Thr Ala Glu Val Phe Val Lys Gly Ala Gly Glu 

210 215 220 

CTG GTG GCG TAC TCG TCG GCG ACG CCC AGG CTC TGC AAG GTG AAC GGC 718 
Leu Val Ala Tyr Ser Ser Ala Thr Pro Arg Leu Cys Lys Val Asn Gly 

225 230 235 

GAC GAG GCC GAG TTC ACG TAC AAG GAC GGC GTG GTC ACC GTC GAC GTG 766 
Asp Glu Ala Glu Phe Thr Tyr Lys Asp Gly Val Val Thr Val Asp Val 
240 245 250 255 

CCG TGG TCG GGG TCG TCG TCG AAG CTG TGT TGC GTC CAG TAC GTC TAC 814 
25 Pro Trp Ser Gly Ser Ser Ser Lys Leu Cys Cys Val Gin Tyr Val Tyr 

260 265 270 

TGA GCCGGACGGG CCGATGACTC TGCGTCTCTG CTCCCTGCTG GCCTGCTCAG GAC 873 
Stop 

ATAATCTAAT GTTTAGAGCT TACCAGGTTT TACAGCTCTA TCAGTTTACT TTTGTTTTTC 933 
TGCTCTTCGT TTTTTAAGAA TTATTTCTAT TGTGTGAATT AATGAGTGCT TTCCTTCTAA 993 
AAA 996 


20 


30 


35 

Claims 

1. A nucleic acid molecule encoding a plant raffinose synthetase, said synthetase being capable of producing raffi- 
nose by combining a D-galactosyl group through an a(1->6) bond with a hydroxyl group attached to the carbon 

40 atom at position 6 of a D -glucose residue in a sucrose molecule, said nucleic acid molecule being selected from 
the group consisting of: 

(a) nucleic acid molecules encoding a protein with the amino acid sequence selected from the group consisting 
of SEQ ID NO:1, SEQ ID NO:3, SEQ ID N0:5, and SEQ ID NO:7; 
45 (b) nucleic acid molecules comprising the coding region of the nucleotide sequence selected from the group 

consisting of SEQ ID NO:2. SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8; 

(c) nucleic acid molecules hybridizing to the complementary strand of a nucleic acid molecule indicated under 
(a)or(b); 

(d) nucleic acid molecules the sequence of which differs from the sequence of a nucleic acid molecule of any 
so one of (a) to (c) due to the degeneracy of the genetic code; and 

(e) a fragment of any of the nucleic acid molecules of any one of (a) to (d). 

2. The nucleic acid molecule encoding a plant raffinose synthetase according to daim 1 , wherein the plant is a dico- 
teiydon, preferably a leguminous plant, preferably a broad bean or soybean, or a lamiaceous plant, preferably a 

55 Japanese artichoke, or a monocotyledon, preferably a gramineous plant, preferably corn. 

3. A probe or primer that specifically hybridizes to a nucleic acid molecul of claim 1 or 2. 


40 
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4. The probe or primer of claim 3 that has a length of at least 1 5, 1 7, 21 or 50 nucleotides. 


5. A chimeric gene comprising th nucleic acid molecule of claim 1 or 2, or the probe or primer of claim 3 or 4. 

5 6. The chimeric gene of claim 5 functionally linked to a promoter. 

7. A plasmid comprising a nucleic acid molecule of claim 1 or 2, optionally in functional combination with a promoter, 
or a chimeric gene of claim 5 or 6. 

10 8. A host organism containing the chimeric gene of claim 5 or 6, or the plasmid of claim 7. 

9. The host organism of claim 8 which is a microorganism, a plant cell or a plant. 

1 0. A method for the production of a raff inose synthetase protein or a portion thereof, comprising the steps of isolating 
15 and purifying a raffinose synthetase protein or a portion thereof from a culture obtained by cultivating the host 

organism of claim 8 or 9. 

11. A raffinose synthetase protein encoded by the nucleic add molecule of claim 1 or 2 or obtained by the method of 
claim 10. 

20 

12. An antisense nucleic acid molecule or a ribozyme specifically hybridizing or binding to a nucleic acid molecule of 
claim 1 or 2. 

13. A method for modifying metabolism of a derived host organism, which comprises introducing the nucleic acid mol- 
25 ecule, probe, primer or chimeric gene of any one of claims 1 to 6 or an antisense nucleic acid molecule or ribozyme 

of claim 12 into said host organism or a cell thereof, so that the content of the raffinose family oligosaccharides in 
said host organism or cell thereof is changed. 

14. A raffinose synthetase protein related to the raffinose synthetase protein of claim 11 by deletion, replacement, 
30 modification or addition of one or several amino acids. 

15. An anti-raff inose synthetase antibody capable of binding to the raffinose synthetase protein of claim 1 1 or 14. 
16. . Use of the anti-raff inose synthetase antibody of claim 15 for the detection of a raffinose synthetase protein. 

35 

17. A method for the detection of a raffinose synthetase gene or a fragment thereof which comprises hybridizing a 
probe of claim 3 or 4 to an organism-derived genomic DNA or cDNA fragment, and detecting the presence of 
hybridization. 

40 18. A method for the amplification of a raffinose synthetase gene or a part thereof, which comprises annealing the 
primer of claim 3 or 4 to organism-derived genomic DNA or cDNA, and amplifying the resulting DNA fragment 

19. The method of claim 18 wherein the DNA fragment is amplified by polymerase chain reaction (PCR). 

45 20. The method of any one of claims 1 7 to 1 9, wherein the organism is a plant. 

21 . A method for obtaining a raffinose synthetase gene, comprising the steps of identifying a DNA fragment containing 
a raffinose synthetase gene or a gene fragment thereof by the method of any one of claims 17 to 20, and isolating 
and purifying the DNA fragment identified. 

so 

22. A raffinose synthetase gene obtained by the method of claim 21 . 
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